HHpred/HHsearch and HHpred / HHsearch: Difference between pages

HHsearch
Developer(s)	Johannes Söding
Stable release	1.5.0 / December 2008
Repository	github.com/soedinglab/hh-suite ;
Written in	C++
Available in	English
Type	Bioinformatics tool
License	Creative Commons Attribution-NonCommercial-2.0
Website	ftp://toolkit.lmb.uni-muenchen.de/hhsearch/

Content deleted Content added

VisualWikitext

Inline

Revision as of 20:36, 14 April 2009

HHsearch is a program for protein sequence searching^[1] that is free for non-commercial use. HHpred is a free protein function and protein structure prediction server based on the HHsearch method.^[2] HHpred/HHsearch are among the most popular methods for protein structure prediction and the detection of remotely related sequences, having been cited over 340 times (Google Scholar search).

Sequence searches are frequently performed by biologists to infer the function of an unknown protein from its sequence. For this purpose, the protein's sequence is compared to the sequences of other proteins in public databases and its function is deduced from those of the most similar sequences. Often, no sequences with annotated functions can be found in such a search. In this case, more sensitive methods are required to identify more remotely related proteins or protein families. From these relationships, hypotheses about the protein's functions, structure, and domain composition can be inferred. HHsearch performs searches with a protein sequence through databases. The HHpred server and the HHsearch software package offer many popular, regularly updated databases, such as the PDB (protein data bank), the InterPro, Pfam, COG, or SCOP databases.

HHsearch belongs to the class of profile-profile comparison tools, which includes the most sensitive sequence search methods to date.^[3] ^[4] ^[5] ^[1] They represent both the query sequence and the database sequences by sequence profiles, also called position-specific scoring matrices (PSSMs). Profiles are calculated from a multiple sequence alignment of related sequences which are typically collected using the PSI-BLAST program^[6] from NCBI . A profile is a matrix containing for each position in the query sequence the similarity score for the 20 amino acids. These scores are calculated from the frequencies of the amino acids at the corresponding positions in the multiple sequence alignment. Because profiles contain much more information than a single sequence (e.g. the position-specific degree of conservation), profile-profile comparison methods are much more powerful than sequence-sequence comparison methods like BLAST or profile-sequence comparison methods like PSI-BLAST.^[3]

HHpred represents query and database proteins by profile hidden Markov models (HMMs), an extension of sequence profiles which also record position-specific amino acid insertion and deletion frequencies. HHsearch searches a database of HMMs with a query HMM. Before starting the search through the actual database of HMMs, HHsearch/HHpred builds a multiple sequence alignment of related sequences using a context-specific version of PSI-BLAST (CSI-BLAST). From this alignment, a profile HMM is calculated. The databases contain HMMs that are precalculated in the same fashion using PSI-BLAST. The output of HHpred and HHsearch is a ranked list of database matches (including E-values and probabilities for a true relationship) and the pairwise query-database sequence alignments. A search through the PDB database of proteins with solved 3D structure takes a few minutes. If a significant match with a protein of known structure (a "template") is found in the PDB database, HHpred allows to build a homology model using the MODELLER software, starting from the pairwise query-template alignment.

Applications of HHpred/HHsearch include protein structure prediction, function prediction, domain prediction, domain boundary prediction, and evolutionary classification of proteins. In the CASP7 benchmark experiment (see CASP - Critical Assessment of Techniques for Protein Structure Prediction), HHpred5 was ranked 2nd out of 68 automatic structure prediction servers, while being more than 50 times faster than the best 20 servers.^[7]

References

^ ^a ^b Söding J (2005). "Protein homology detection by HMM-HMM comparison". Bioinformatics. 21 (7): 951–960. PMID 15531603.
^ Söding J, Biegert A, Lupas AN. (2005). "The HHpred interactive server for protein homology detection and structure prediction\". Nucleic Acids Res. 33 ((Web Server issue)): W244-248. PMID 15980461.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ ^a ^b Jaroszewski L, Rychlewski L, Godzik A. (2000). "Improving the quality of twilight-zone alignments". Protein Sci. 9 (8): 1487–1496. PMID 10975570.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ Sadreyev RI, Baker D, Grishin NV (2003). "Profile-profile comparisons by COMPASS predict intricate homologies between protein families". Protein Sci. 12 (10): 2262–2272. PMID 14500884.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ Dunbrack RL Jr. (2006). "Sequence comparison and protein structure prediction". Curr Opin Struct Biol. 16 (3): 374–384. PMID 16713709.
^ Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990). "Basic local alignment search tool". J Mol Biol. 215 (3): 403–410. PMID 2231712.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ Battey JN, Kopp J, Bordoli L, Read RJ, Clarke ND, Schwede T (2007). "Automated server predictions in CASP7". Proteins. 69 (Suppl 8): 68-82. PMID 17894354.{{cite journal}}: CS1 maint: multiple names: authors list (link)

External links

http://toolkit.lmb.uni-muenchen.de/hhpred (free server at University of Munich (LMU))
http://toolkit.tuebingen.mpg.de/hhpred (free server at Max-Planck Institute in Tuebingen)
CASP website

This bioinformatics-related article is a stub. You can help Wikipedia by expanding it.

[pmid15531603-1] Söding J (2005). "Protein homology detection by HMM-HMM comparison". Bioinformatics. 21 (7): 951–960. PMID 15531603.

[pmid15980461-2] Söding J, Biegert A, Lupas AN. (2005). "The HHpred interactive server for protein homology detection and structure prediction\". Nucleic Acids Res. 33 ((Web Server issue)): W244-248. PMID 15980461.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[pmid10975570-3] Jaroszewski L, Rychlewski L, Godzik A. (2000). "Improving the quality of twilight-zone alignments". Protein Sci. 9 (8): 1487–1496. PMID 10975570.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[4] Sadreyev RI, Baker D, Grishin NV (2003). "Profile-profile comparisons by COMPASS predict intricate homologies between protein families". Protein Sci. 12 (10): 2262–2272. PMID 14500884.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[5] Dunbrack RL Jr. (2006). "Sequence comparison and protein structure prediction". Curr Opin Struct Biol. 16 (3): 374–384. PMID 16713709.

[6] Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990). "Basic local alignment search tool". J Mol Biol. 215 (3): 403–410. PMID 2231712.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[7] Battey JN, Kopp J, Bordoli L, Read RJ, Clarke ND, Schwede T (2007). "Automated server predictions in CASP7". Proteins. 69 (Suppl 8): 68-82. PMID 17894354.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

@@ Line 85: / Line 85: @@
 }}
 </ref>
 ==See also==
@@ Line 95: / Line 94: @@
 *[[BLAST| BLAST (Basic Local Alignment Search Tool)]]
 *[[CS-BLAST| Context-specific BLAST (CS-BLAST)]]
 ==References==
 {{reflist}}
 ==External links==