Kabsch algorithm

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

The Kabsch algorithm, named after Wolfgang Kabsch, is a method for calculating the optimal rotation matrix that minimizes the RMSD (root mean squared deviation) between two paired sets of points. It is useful in graphics, cheminformatics to compare molecular structures, and also bioinformatics for comparing protein structures (in particular, see root-mean-square deviation (bioinformatics)).

The algorithm only computes the rotation matrix, but it also requires the computation of a translation vector. When both the translation and rotation are actually performed, the algorithm is sometimes called partial Procrustes superimposition (see also orthogonal Procrustes problem).

Description[edit]

The algorithm for the rotation of P unto Q starts with two sets of paired points, P and Q. Each set of points can be represented as an N × 3 matrix. The first row is the coordinates of the first point, the second row is the coordinates of the second point, the Nth row is the coordinates of the Nth point.

The algorithm works in three steps: a translation, the computation of a covariance matrix, and the computation of the optimal rotation matrix.

Translation[edit]

Both sets of coordinates must be translated first, so that their centroid coincides with the origin of the coordinate system. This is done by subtracting from the point coordinates the coordinates of the respective centroid.

Computation of the covariance matrix[edit]

The second step consists of calculating a cross-covariance matrix H. In matrix notation,

or, using summation notation,

Computation of the optimal rotation matrix[edit]

It is possible to calculate the optimal rotation R based on the matrix formula

but implementing a numerical solution to this formula becomes complicated when all special cases are accounted for (for example, the case of H not having an inverse).

If singular value decomposition (SVD) routines are available, the optimal rotation, R, can be calculated using the following simple algorithm.

First, calculate the SVD of the covariance matrix H.

Next, decide whether we need to correct our rotation matrix to ensure a right-handed coordinate system

Finally, calculate our optimal rotation matrix, R, as

The optimal rotation matrix can also be expressed in terms of quaternions.[1][2][3][4] This alternative description was recently used in the development of a rigorous method for removing rigid-body motions from molecular dynamics trajectories of flexible molecules.[5] In 2002 a generalization for the application to probability distributions (continuous or not) was also proposed.[6]

Generalizations[edit]

The algorithm was described for points in a three-dimensional space. The generalization to D dimensions is immediate.

External links[edit]

This SVD algorithm is described in more detail at http://cnx.org/content/m11608/latest/

A Matlab function is available at http://www.mathworks.com/matlabcentral/fileexchange/25746-kabsch-algorithm

A C++ implementation (and unit test) using Eigen

A Python script is available at https://github.com/charnley/rmsd

A free PyMol plugin easily implementing Kabsch is Cealign. VMD uses the Kabsch algorithm for its alignment.

See also[edit]

References[edit]

  1. ^ Horn, Berthold K. P. (1987-04-01). "Closed-form solution of absolute orientation using unit quaternions". Journal of the Optical Society of America A. 4 (4). doi:10.1364/josaa.4.000629. ISSN 1520-8532. 
  2. ^ Kneller, Gerald R. (1991-05-01). "Superposition of Molecular Structures using Quaternions". Molecular Simulation. 7 (1–2): 113–119. doi:10.1080/08927029108022453. ISSN 0892-7022. 
  3. ^ Coutsias, E. A.; Seok, C.; Dill, K. A. (2004). "Using quaternions to calculate RMSD". J. Comput. Chem. 25 (15): 1849–1857. doi:10.1002/jcc.20110. PMID 15376254. 
  4. ^ Petitjean, M. (1999). "On the root mean square quantitative chirality and quantitative symmetry measures". J. Math. Phys. 40 (9): 4587–4595. doi:10.1063/1.532988. 
  5. ^ "Least constraint approach to the extraction of internal motions from molecular dynamics trajectories of flexible macromolecules". J. Chem. Phys. 135 (8): 084110. 2011-08-24. doi:10.1063/1.3626275. ISSN 0021-9606. 
  6. ^ Petitjean, M. (2002). "Chiral mixtures". J. Math. Phys. 43 (8): 4147–4157. doi:10.1063/1.1484559. 
  • Kabsch, Wolfgang (1976). "A solution for the best rotation to relate two sets of vectors". Acta Crystallographica. A32: 922. doi:10.1107/S0567739476001873. 
    • With a correction in Kabsch, Wolfgang (1978). "A discussion of the solution for the best rotation to relate two sets of vectors". Acta Crystallographica. A34: 827–828. doi:10.1107/S0567739478001680. 
  • Lin, Ying-Hung; Chang, Hsun-Chang; Lin, Yaw-Ling (December 15–17, 2004). A Study on Tools and Algorithms for 3-D Protein Structures Alignment and Comparison. International Computer Symposium. Taipei, Taiwan. 
  • Umeyama, Shinj (1991). "Least-Squares Estimation of Transformation Parameters Between Two Point Patterns". IEEE Trans. Pattern Anal. Mach. Intell. 13 (4): 376–380. doi:10.1109/34.88573.