Beta version
Chemical cross-linking of proteins or protein complexes and the mass spectrometry based localization of the cross-linked amino acids is a powerful method for generating distance restraints on the substrate’s topology. Xwalk was written to predict and validate these cross-links on existing protein structures. Xwalk calculates and displays non-linear distances between chemically cross-linked amino acids on protein surfaces, while mimicking the flexibility and non-linearity of cross-linker molecules. It returns a Solvent Accessible Surface Distance, which corresponds to the length of the shortest path between two amino acids, where the path leads through solvent occupied space without penetrating the protein surface.
Click here to read our Application Note in the Bioinformatics journal.
The inclusion of experimentally determined distance restraints in the computation of protein structures and complex topologies has become a key technique to increase the reliability of computational structure prediction 1,2. Chemical cross-links are a valuable source for such distance restraints 3,4. Of particular interest are cross-link modifications, where the cross-linker molecule covalently connects a pair of peptides. If both peptides originate from within the same protein chain the cross-link is referred to as intra-protein cross-link. In contrary, a cross-link between two peptides from distinct protein chains is called inter-protein cross-link 3,5. Other frequent modifications include mono-links and loop-links, where only one side of a bi-functional cross-linker molecule has reacted with the protein or twice with a single peptide, respectively. Xwalk does not distinguish between loop and intra-protein cross-links.You can get a list of all theoretically possible (virtual) intra or inter-protein cross-links by running Xwalk in Production Mode.
So far, in protein structure prediction distance restraints from cross-linking experiments have been employed as an upper limit on the Euclidean distance between a pair of cross-linked amino acids. With Xwalk you can too use the Euclidean distance to validate your cross-link data. However, be aware that Euclidean distance is not a precise measure for deducing the cross-linkability between two amino acids. The Euclidean distance metric being a standard L2 norm represents the length of the vector that connects two points (here any two atom centers of the cross-linked amino acids) in Cartesian space. However, such two points on the protein surface are likely separated by molecular slopes and depressions and form a physical barrier for a cross-linker molecule. The cross-linker molecule cannot penetrate these barriers but must circumvent them to bridge two reactive amino acids covalently together. Thus, a more precise representation of cross-linked amino acids on protein surfaces requires the incorporation of a non-linear distance measure that accommodates the flexibility of the cross-linker molecule and the cross-linked side chains.
Xwalk mimics the cross-linker molecule by calculating the shortest path between two amino acids on the protein surface while circumventing slopes and bridging depressions. Important thereby is that the path throughout its length does not penetrate the protein surface and only lead through solvent occupied space. We refer to the length of the shortest path as the Solvent-Accessible-Surface Distance (SASD).
SASD are calculated based on the following breadth first search algorithm, exemplary between ε-amino groups of lysine residues:
Please refer to the following reference when you cite Xwalk:
Kahraman A., Malmström L., Aebersold R. (2011). Xwalk: Computing and Visualizing Distances in Cross-linking Experiments. Bioinformatics, doi:10.1093/bioinformatics/btr348.