Jump to content

User:Emandems1/sandbox

From Wikipedia, the free encyclopedia


Introduction[edit]

Lattice proteins are highly simplified computer models of proteins which are used to investigate protein folding[1]. Because proteins are such large molecules, there are severe computational limits on the simulated timescales of their behavior when modeled in all-atom detail. The millisecond regime for all-atom simulations was not reached until 2010[2], and it is still impossible to fold all real proteins on a computer. The lattice protein model simplifies the protein folding problem in two ways: each whole residue (amino acid) is modeled as a single "bead" classified as one of a finite set of types (such as hydrophobic or polar), and each residue is placed on a vertex of a lattice. To guarantee the connectivity of the protein chain, adjacent residues on the backbone must be placed on adjacent vertices of the lattice. Steric constraints are expressed by imposing that no more than one residue can be placed on the same lattice vertex. Simplifications of this kind significantly reduce the computational effort in handling the model, although even in this simplified scenario the protein folding problem is NP-complete.[3]

Overview[edit]

Different versions of lattice proteins may adopt different types of lattice (typically square and triangular ones), in two or three dimensions, but it has been shown that generic lattices can be used and handled via a uniform approach.[4]

Lattice proteins are made to resemble real proteins by introducing an energy function, a set of conditions which specify the interaction energy between neighboring beads, usually those occupying adjacent lattice sites. The energy function mimics the interactions between amino acids in real proteins such as steric, hydrophobic and hydrogen bonding interactions. The beads are divided into types, and the energy function specifies the interactions depending on the bead type, just as different types of amino acids interact differently. One of the most popular lattice models, the HP model,[5] features just two bead types—hydrophobic (H) and polar (P)—and mimics the hydrophobic effect by specifying a favorable interaction between H beads.

For any sequence in any particular structure, an energy can be rapidly calculated from the energy function. For the simple HP model, this is simply an enumeration of all the contacts between H residues that are adjacent in the structure but not in the chain. Most researchers consider a lattice protein sequence protein-like only if it possesses a single structure with an energetic state lower than in any other structure. This is the energetic ground state, or native state. The relative positions of the beads in the native state constitute the lattice protein's tertiary structure. Lattice proteins do not have genuine secondary structure; however, some researchers have claimed that they can be extrapolated onto real protein structures which do include secondary structure, by appealing to the same law by which the phase diagrams of different substances can be scaled onto one another (the theorem of corresponding states).[6]

By varying the energy function and the bead sequence of the chain (the primary structure), effects on the native state structure and the kinetics of folding can be explored, and this may provide insights into the folding of real proteins. In particular, lattice models have been used to investigate the energy landscapes of proteins, i.e. the variation of their internal free energy as a function of conformation.

Lattices[edit]

A lattice is a set of orderly points that are connected by "edges"[4]. These points are called vertices and are connected to a certain number other vertices in the lattice by edges. The number of vertices each individual vertex is connected to is called the coordination number of the lattice, and it can be scaled up or down by changing the shape or dimension (2-dimensional to 3-dimensional, for example) of the lattice[4]. This number is important in shaping the characteristics of the lattice protein because it controls the number of other residues allowed to be adjacent to a given residue[4]. It has been shown that for most proteins the coordination number of the lattice used should fall between 3 and 20, although most commonly used lattices have coordination numbers at the lower end of this range[4].

Lattice shape is an important factor in the accuracy of lattice protein models. Changing lattice shape can dramatically alter the shape of the energetically favorable conformations[4]. It can also add unrealistic constraints to the protein structure such as in the case of the parity problem where in square and cubic lattices residues of the same parity (odd or even numbered) cannot make hydrophobic contact[7]. It has also been reported that triangular lattices yield more accurate structures than other lattice shapes when compared to crystallographic data[4]. To combat the parity problem, several researchers have suggested using triangular lattices when possible, as well as a square matrix with diagonals for theoretical applications where the square matrix may be more appropriate[7]. Hexagonal lattices with diagonals have also been suggested as a way to combat the parity problem[4].

Hydrophobic-Polar Model[edit]

Schematic showing a hypothetical polypeptide to demonstrate the main ideas of the hydrophobic-polar (HP) lattice protein folding model. Amino acid residues are represented as dots and colored yellow or green to represent polar or hydrophobic amino acids respectively.

The hydrophobic-polar protein model is the original lattice protein model and was first proposed by Dill et. al. in 1985 as a way to overcome the significant cost and difficulty of predicting protein structure, using only the hydrophobicity of the amino acids in the protein to predict the protein structure[7]. It is considered to be the paradigmatic lattice protein model[4]. The method was able to quickly give an estimate of protein structure by representing proteins as "short chains on a 2D square lattice" and has since become known as the hydrophobic-polar model. It breaks the protein folding problem into three separate problems: modeling the protein conformation, defining the energetic properties of the amino acids as they interact with one another to find said conformation, and developing an efficient algorithm for the prediction of these conformations. It is done by classifying amino acids in the protein as either hydrophobic or polar and assuming that the protein is being folded in an aqueous environment. The lattice statistical model seeks to recreate protein folding by minimizing the free energy of the contacts between hydrophobic amino acids. Hydrophobic amino acid residues are predicted to group around each other, while hydrophilic residues interact with the surrounding water[7].

Problems and Alternative Models[edit]

The simplicity of the hydrophobic polar model has caused it to have several problems that people have attempted to correct with alternative lattice protein models[7]. Chief among these problems is the issue of degeneracy, which is when there is more than one minimum energy conformation for the modeled protein, leading to uncertainty about which conformation is the native conformation. Attempts to address this include the HPNX model which classifies amino acids as hydrophobic (H), positive (P), negative (N), or neutral (X) according to the charge of the amino acid[8], adding additional parameters to reduce the number of low energy conformations and allows for more realistic protein simulations[7]. Another model is the Crippen model which uses protein characteristics taken from crystal structures to inform the choice of native conformation[9].

Another issue with lattice models is that they generally don't take into account the space taken up by amino acid side chains, instead considering only the α-carbon[4]. The side chain model addresses this by adding a side chain to the vertex adjacent to the α-carbon[10].

References[edit]

  1. ^ Lau, Kit Fun; Dill, Ken A. (1989). "A lattice statistical mechanics model of the conformational and sequence spaces of proteins". Macromolecules. 22 (10): 3986–3997. doi:10.1021/ma00200a030. ISSN 0024-9297.
  2. ^ Voelz, Vincent A.; Bowman, Gregory R.; Beauchamp, Kyle; Pande, Vijay S. (2010-02-10). "Molecular Simulation ofab InitioProtein Folding for a Millisecond Folder NTL9(1−39)". Journal of the American Chemical Society. 132 (5): 1526–1528. doi:10.1021/ja9090353. ISSN 0002-7863. PMC 2835335. PMID 20070076.{{cite journal}}: CS1 maint: PMC format (link)
  3. ^ B. Berger and T. Leighton (1998). "Protein folding in the hydrophobic-hydrophilic (HP) model is NP-complete". Journal of Computational Biology. 5 (1): 27–40. doi:10.1089/cmb.1998.5.27. PMID 9541869.
  4. ^ a b c d e f g h i j Bechini, A. (2013). "On the characterization and software implementation of general protein lattice models". PLoS ONE. 8 (3): e59504. doi:10.1371/journal.pone.0059504. PMC 3612044. PMID 23555684.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  5. ^ Dill, K. A. (1985). "Theory for the folding and stability of globular proteins". Biochemistry. 24 (6): 1501–9. doi:10.1021/bi00327a032. PMID 3986190.
  6. ^ Onuchic, J. N.; Wolynes, P. G.; Luthey-Schulten, Z.; Socci, N.D. (1995). "Toward an outline of the topography of a realistic protein folding funnel". Proc. Natl. Acad. Sci. USA. 92 (8): 3626–30. doi:10.1073/pnas.92.8.3626. PMC 42220. PMID 7724609.
  7. ^ a b c d e f Dubey, Sandhya P.N.; Kini, N. Gopalakrishna; Balaji, S.; Kumar, M. Sathish (2018). "A Review of Protein Structure Prediction Using Lattice Model". Critical Reviews in Biomedical Engineering. 46 (2): 147–162. doi:10.1615/critrevbiomedeng.2018026093. ISSN 0278-940X.
  8. ^ Bornberg-Bauer, E.; Will, S.; Backofen, R. (1999-03-01). "Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets". Bioinformatics. 15 (3): 234–242. doi:10.1093/bioinformatics/15.3.234. ISSN 1367-4803.
  9. ^ Crippen, Gordon M. (1991-04-30). "Prediction of protein folding from amino acid sequence over discrete conformation spaces". Biochemistry. 30 (17): 4232–4237. doi:10.1021/bi00231a018. ISSN 0006-2960.
  10. ^ Dill, Ken; Bromberg, Sarina; Yue, Kaizhi; Chan, Hue Sun; Ftebig, Klaus; Yee, David; Thomas, Paul (April 1995). "Principles of protein folding — A perspective from simple exact models". Protein Science. 7 (4): 561–602. doi:10.1002/pro.5560040401.