Nuclear magnetic resonance chemical shift re-referencing: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Anchiguo (talk | contribs)
No edit summary
Anchiguo (talk | contribs)
No edit summary
Line 2: Line 2:
==Importance of NMR chemical shift re-referencing in biomolecular NMR==
==Importance of NMR chemical shift re-referencing in biomolecular NMR==


Incorrect chemical shift referencing is a particularly acute problem in biomolecular NMR <ref name=pmid8589602>{{cite journal |title=1H, 13C and 15N chemical shift referencing in biomolecular NMR.|last=Wishart|first=DS|coauthors=Bigam CG, Yao J, Abildgaard F, Dyson HJ, Oldfield E, Markley JL, Sykes BD|journal=Journal of Biomolecular NMR|date=1995|volume=6|page=135-40|pmid= 8589602}}</ref>[17]. It has been estimated that up to 20% of 13C and up to 35% of 15N shift assignments are improperly referenced <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>
Incorrect chemical shift referencing is a particularly acute problem in biomolecular NMR <ref name=pmid8589602>{{cite journal |title=1H, 13C and 15N chemical shift referencing in biomolecular NMR.|last=Wishart|first=DS|coauthors=Bigam CG, Yao J, Abildgaard F, Dyson HJ, Oldfield E, Markley JL, Sykes BD|journal=Journal of Biomolecular NMR|date=1995|volume=6|page=135-40|pmid= 8589602}}</ref>. It has been estimated that up to 20% of 13C and up to 35% of 15N shift assignments are improperly referenced <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>
<ref name=pmid11460554>{{cite journal |title=Use of chemical shifts in macromolecular structure determination.|last=Wishart|first=DS|coauthors=Case DA|journal=Methods in Enzymology|date=2001|volume=338|page=3-34|pmid= 11460554}}</ref>
<ref name=pmid11460554>{{cite journal |title=Use of chemical shifts in macromolecular structure determination.|last=Wishart|first=DS|coauthors=Case DA|journal=Methods in Enzymology|date=2001|volume=338|page=3-34|pmid= 11460554}}</ref>
<ref name=pmid19436955>
<ref name=pmid19436955>
{{cite journal |title= Empirical correlation between protein backbone 15N and 13C secondary chemical shifts and its application to nitrogen chemical shift re-referencing.|last=Wang|first=L| journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=95-99|pmid= 19436955}}</ref>
{{cite journal |title= Empirical correlation between protein backbone 15N and 13C secondary chemical shifts and its application to nitrogen chemical shift re-referencing.|last=Wang|first=L| journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=95-99|pmid= 19436955}}</ref>. Given that the structural and dynamic information contained within chemical shifts is often quite subtle, it is critical that protein chemical shifts be properly referenced so that these subtle differences can be detected. Fundamentally, the problem with chemical shift referencing comes from the fact that chemical shifts are relative frequency measurements rather than absolute frequency measurements. Because of the historic problems with chemical shift referencing, chemical shifts are perhaps the most precisely measurable but the least accurately measured parameters in all of NMR spectroscopy <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>
<ref name=pmid11460554>{{cite journal |title=Use of chemical shifts in macromolecular structure determination.|last=Wishart|first=DS|coauthors=Case DA|journal=Methods in Enzymology|date=2001|volume=338|page=3-34|pmid= 11460554}}</ref>.
[2,4,11]. Given that the structural and dynamic information contained within chemical shifts is often quite subtle, it is critical that protein chemical shifts be properly referenced so that these subtle differences can be detected. Fundamentally, the problem with chemical shift referencing comes from the fact that chemical shifts are relative frequency measurements rather than absolute frequency measurements. Because of the historic problems with chemical shift referencing, chemical shifts are perhaps the most precisely measurable but the least accurately measured parameters in all of NMR spectroscopy <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>
<ref name=pmid11460554>{{cite journal |title=Use of chemical shifts in macromolecular structure determination.|last=Wishart|first=DS|coauthors=Case DA|journal=Methods in Enzymology|date=2001|volume=338|page=3-34|pmid= 11460554}}</ref>[1, 4].


==Programs for protein chemical shift re-referencing==
==Programs for protein chemical shift re-referencing==


Because of the magnitude and severity of the problems with chemical shift referencing in biomolecular NMR, a number of computer programs have been developed to help mitigate the problem (see Table 1 for a summary). The first program to comprehensively tackle chemical shift mis-referencing in biomolecular NMR was SHIFTCOR <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>[2].
Because of the magnitude and severity of the problems with chemical shift referencing in biomolecular NMR, a number of computer programs have been developed to help mitigate the problem (see Table 1 for a summary). The first program to comprehensively tackle chemical shift mis-referencing in biomolecular NMR was SHIFTCOR <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>.




Table 1. Summary and comparison of different chemical shift re-referencing and mis-assignment detection programs <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>.
Table 1. Summary and comparison of different chemical shift re-referencing and mis-assignment detection programs <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>.
{| class="wikitable"
{| class="wikitable"
! scope="col" | Program [Reference] [URL]
! scope="col" | Program [Reference]
! scope="col" | Detects or performs shift re-referencing
! scope="col" | Detects or performs shift re-referencing
! scope="col" | Detects gross assignment errors
! scope="col" | Detects gross assignment errors
Line 38: Line 37:
==SHIFTCOR: A structure-based chemical shift correction program==
==SHIFTCOR: A structure-based chemical shift correction program==


SHIFTCOR is an automated protein chemical shift correction program that uses statistical methods to compare and correct predicted NMR chemical shifts (derived from the 3D structure of the protein) relative to an input set of experimentally measured chemical shifts. SHIFTCOR uses several simple statistical approaches and pre-determined cut-off values to identify and correct potential referencing, assignment and typographical errors. SHIFTCOR identifies potential chemical shift referencing problems by comparing the difference between the average value of each set of observed backbone (1Hα, 13Cα, 13Cβ, 13CO, 15N and 1HN) shifts and their corresponding predicted chemical shifts. The difference between these two averages results in a nucleus-specific chemical shift offset or reference correction (i.e. one for 1H, one for 13C and one for 15N). In order to ensure that certain extreme outliers do not unduly bias these average offset values, the average of the observed shifts is only calculated after excluding potential mis-assignments or typographical errors <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>[2].
SHIFTCOR is an automated protein chemical shift correction program that uses statistical methods to compare and correct predicted NMR chemical shifts (derived from the 3D structure of the protein) relative to an input set of experimentally measured chemical shifts. SHIFTCOR uses several simple statistical approaches and pre-determined cut-off values to identify and correct potential referencing, assignment and typographical errors. SHIFTCOR identifies potential chemical shift referencing problems by comparing the difference between the average value of each set of observed backbone (1Hα, 13Cα, 13Cβ, 13CO, 15N and 1HN) shifts and their corresponding predicted chemical shifts. The difference between these two averages results in a nucleus-specific chemical shift offset or reference correction (i.e. one for 1H, one for 13C and one for 15N). In order to ensure that certain extreme outliers do not unduly bias these average offset values, the average of the observed shifts is only calculated after excluding potential mis-assignments or typographical errors <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>.


==SHIFTCOR output:==
==SHIFTCOR output:==
[[File:SHIFTCOR fig1.png|thumb|Figure 1. An example SHIFTCOR output for bmr4375.str]]
[[File:SHIFTCOR fig1.png|thumb|Figure 1. An example SHIFTCOR output for bmr4375.str]]
SHIFTCOR generates and reports chemical shift offsets or differences for each nucleus. The results contain the chemical shift analyses (including lists of potential mis-assignments, the estimated referencing errors, the estimated error in the calculated reference offset (95% confidence interval), the applied or suggested reference offset, correlation coefficients, RMSD values) and the corrected BMRB formatted chemical shift file (see Figure 1 for details) <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>[2].
SHIFTCOR generates and reports chemical shift offsets or differences for each nucleus. The results contain the chemical shift analyses (including lists of potential mis-assignments, the estimated referencing errors, the estimated error in the calculated reference offset (95% confidence interval), the applied or suggested reference offset, correlation coefficients, RMSD values) and the corrected BMRB formatted chemical shift file (see Figure 1 for details) <ref name=pmid12652131>{{cite journal |title=RefDB: A database of uniformly referenced protein chemical shifts|last=Zhang|first=H|coauthors=Neal, S. and Wishart, D.S.|journal=J. Biomol. NMR|date=Mar 2003|volume=25|page=173-195|doi=10.1023/A:1022836027055|pmid=12652131}}</ref>.


SHIFTCOR uses the chemical shift calculation program SHIFTX <ref name=pmid12766419>{{cite journal |title=Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts.|last=Neal|first=S|coauthors=Nip AM, Zhang H, Wishart DS|journal=Journal of Biomolecular NMR|date=Jul 2003|volume=26|page=215-240|doi=10.1023/A:1023812930288|pmid= 12766419}}</ref>[19] to predict 1Hα, 13Cα,15N shifts based on the 3D structure coordinates of the protein being analyzed. By comparing the predicted shifts to the observed shifts, SHIFTCOR is able to accurately identify chemical shift reference offsets as well as potential mis-assignments. A key limitation to the SHIFTCOR approach is that requires that the 3D structure for the target protein be available to assess the chemical shift reference offsets. Given that chemical shift assignments are typically made before the structure is determined, it was soon realized that structure-independent approaches were required to develop <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>[1].
SHIFTCOR uses the chemical shift calculation program SHIFTX <ref name=pmid12766419>{{cite journal |title=Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts.|last=Neal|first=S|coauthors=Nip AM, Zhang H, Wishart DS|journal=Journal of Biomolecular NMR|date=Jul 2003|volume=26|page=215-240|doi=10.1023/A:1023812930288|pmid= 12766419}}</ref> to predict 1Hα, 13Cα,15N shifts based on the 3D structure coordinates of the protein being analyzed. By comparing the predicted shifts to the observed shifts, SHIFTCOR is able to accurately identify chemical shift reference offsets as well as potential mis-assignments. A key limitation to the SHIFTCOR approach is that requires that the 3D structure for the target protein be available to assess the chemical shift reference offsets. Given that chemical shift assignments are typically made before the structure is determined, it was soon realized that structure-independent approaches were required to develop <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref>.


==Structure-independent chemical shift correction programs==
==Structure-independent chemical shift correction programs==


Several methods have been developed that make use of the estimated (via 1H or 13C shifts) or predicted (via sequence) secondary structure content of the protein being analyzed. These programs include PSSI <ref name=pmid15772753>{{cite journal |title= A simple method to adjust inconsistently referenced 13C and 15N chemical shift assignments of proteins.|last= Wang |first=Y|coauthors= Wishart DS |journal=Journal of Biomolecular NMR.|date=2005|volume=31|page=143-148|pmid=15772753}}</ref>[12], CheckShift <ref name=pmid17899394>{{cite journal |title= CheckShift: automatic correction of inconsistent chemical shift referencing.|last= Ginzinger |first=SW|coauthors= Gerick F, Coles M, Heun V|journal=Journal of Biomolecular NMR.|date=2007|volume=39|page=223-227|pmid=17899394}}</ref>[13], <ref name=pmid19575298>{{cite journal |title= CheckShift improved: fast chemical shift reference correction with high accuracy.|last= Ginzinger |first=SW|coauthors= Skocibusić M, Heun V |journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=207-211|pmid=19575298}}</ref>[14], LACS <ref name=pmid16041479>{{cite journal |title=Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications.|last=Wang|first=L|coauthors=Eghbalnia HR, Bahrami A, Markley JL|journal=Journal of Biomolecular NMR.|date=May 2005|volume=32|page=13-22|pmid= 16041479}}</ref>[10], <ref name=pmid19436955>{{cite journal |title= Empirical correlation between protein backbone 15N and 13C secondary chemical shifts and its application to nitrogen chemical shift re-referencing.|last=Wang|first=L| journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=95-99|pmid= 19436955}}</ref>[11] and PANAV <ref name=pmid20446018>{{cite journal |title= A probabilistic approach for validating protein NMR chemical shift assignments.|last=Wang|first=B|coauthors=Wang Y|journal=Journal of Biomolecular NMR.|date=2010|volume=47|page=85-99|pmid=20446018}}</ref>[15]. Both PANAV (http://www.wishartlab.com/web_servers/panav) and CheckShift (http://checkshift.services.came.sbg.ac.at/) are also available as web servers.
Several methods have been developed that make use of the estimated (via 1H or 13C shifts) or predicted (via sequence) secondary structure content of the protein being analyzed. These programs include PSSI <ref name=pmid15772753>{{cite journal |title= A simple method to adjust inconsistently referenced 13C and 15N chemical shift assignments of proteins.|last= Wang |first=Y|coauthors= Wishart DS |journal=Journal of Biomolecular NMR.|date=2005|volume=31|page=143-148|pmid=15772753}}</ref>, CheckShift <ref name=pmid17899394>{{cite journal |title= CheckShift: automatic correction of inconsistent chemical shift referencing.|last= Ginzinger |first=SW|coauthors= Gerick F, Coles M, Heun V|journal=Journal of Biomolecular NMR.|date=2007|volume=39|page=223-227|pmid=17899394}}</ref>, <ref name=pmid19575298>{{cite journal |title= CheckShift improved: fast chemical shift reference correction with high accuracy.|last= Ginzinger |first=SW|coauthors= Skocibusić M, Heun V |journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=207-211|pmid=19575298}}</ref>, LACS <ref name=pmid16041479>{{cite journal |title=Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications.|last=Wang|first=L|coauthors=Eghbalnia HR, Bahrami A, Markley JL|journal=Journal of Biomolecular NMR.|date=May 2005|volume=32|page=13-22|pmid= 16041479}}</ref>, <ref name=pmid19436955>{{cite journal |title= Empirical correlation between protein backbone 15N and 13C secondary chemical shifts and its application to nitrogen chemical shift re-referencing.|last=Wang|first=L| journal=Journal of Biomolecular NMR.|date=2009|volume=44|page=95-99|pmid= 19436955}}</ref> and PANAV <ref name=pmid20446018>{{cite journal |title= A probabilistic approach for validating protein NMR chemical shift assignments.|last=Wang|first=B|coauthors=Wang Y|journal=Journal of Biomolecular NMR.|date=2010|volume=47|page=85-99|pmid=20446018}}</ref>. Both PANAV (http://www.wishartlab.com/web_servers/panav) and CheckShift (http://checkshift.services.came.sbg.ac.at/) are also available as web servers.


The PSSI and PANAV programs use the secondary structure determined by 1H shifts (which are almost never mis-referenced) to adjust the target protein’s 13C and 15N shifts to match the 1H-derived secondary structure. LACS uses the difference between secondary 13Cα and 13Cβ shifts plotted against secondary 13Cα shifts or secondary 13Cβ shifts to determine reference offsets. A more recent version of LACS [68] has been adapted to identify 15N chemical shift mis-referencing. This new version of LACS exploits the well-known relationship between 15N shifts and the 13Cα (or 13Cβ shifts of the preceding residue <ref name=pmid11460554></ref>[4]. In contrast to LACS and PANAV/PSSI, CheckShift uses secondary structure predicted from high-performance secondary structure prediction programs such as PSIPRED <ref name=pmid10869041>{{cite journal |title= The PSIPRED protein structure prediction server.|last= McGuffin|first=LJ|coauthors= Bryson K, Jones DT|journal= Bioinformatics|date=2000|volume=16|page=404–405|pmid= 10869041}}</ref>[16] to iteratively adjust 13C and 15N chemical shifts so that their secondary shifts match the predicted secondary structure. These programs have all been shown to accurately identify mis-referenced and properly re-reference protein chemical shifts deposited in the BMRB <ref name=pmid19575298></ref>[14], <ref name=pmid20446018></ref>[15]. Note that both LACS and CheckShift are programmed to always predict the same offset for 13Cα and 13Cβ shifts, whereas PSSI and PANAV do not make this assumption. As a general rule, PANAV and PSSI typically exhibit a smaller spread (or standard deviation) in calculated reference offsets, indicating that these programs are slightly more precise than either LACS or CheckShift. Interestingly, neither LACS nor CheckShift are able to handle proteins that have the extremely large (above 40 ppm) reference offsets, whereas PANAV and PSSI seem to be able to deal with these kinds of anomalous proteins <ref name=pmid20446018></ref>[15].
The PSSI and PANAV programs use the secondary structure determined by 1H shifts (which are almost never mis-referenced) to adjust the target protein’s 13C and 15N shifts to match the 1H-derived secondary structure. LACS uses the difference between secondary 13Cα and 13Cβ shifts plotted against secondary 13Cα shifts or secondary 13Cβ shifts to determine reference offsets. A more recent version of LACS [68] has been adapted to identify 15N chemical shift mis-referencing. This new version of LACS exploits the well-known relationship between 15N shifts and the 13Cα (or 13Cβ shifts of the preceding residue <ref name=pmid11460554></ref>. In contrast to LACS and PANAV/PSSI, CheckShift uses secondary structure predicted from high-performance secondary structure prediction programs such as PSIPRED <ref name=pmid10869041>{{cite journal |title= The PSIPRED protein structure prediction server.|last= McGuffin|first=LJ|coauthors= Bryson K, Jones DT|journal= Bioinformatics|date=2000|volume=16|page=404–405|pmid= 10869041}}</ref> to iteratively adjust 13C and 15N chemical shifts so that their secondary shifts match the predicted secondary structure. These programs have all been shown to accurately identify mis-referenced and properly re-reference protein chemical shifts deposited in the BMRB <ref name=pmid19575298></ref>, <ref name=pmid20446018></ref>. Note that both LACS and CheckShift are programmed to always predict the same offset for 13Cα and 13Cβ shifts, whereas PSSI and PANAV do not make this assumption. As a general rule, PANAV and PSSI typically exhibit a smaller spread (or standard deviation) in calculated reference offsets, indicating that these programs are slightly more precise than either LACS or CheckShift. Interestingly, neither LACS nor CheckShift are able to handle proteins that have the extremely large (above 40 ppm) reference offsets, whereas PANAV and PSSI seem to be able to deal with these kinds of anomalous proteins <ref name=pmid20446018></ref>.


In a recent study <ref name=pmid20446018></ref>[15], a chemical shift re-referencing program (PANAV) was run on a total of 2421 BMRB entries that had a sufficient proportion of (>80%) of assigned chemical shifts to perform a robust chemical shift reference correction. A total of 243 entries were found with 13Cα shifts offset by more than 1.0 ppm, 238 entries with 13Cβ shifts offset of more than 1.0 ppm, 200 entries with 13C’ shifts offset of more than 1.0 ppm and 137 entries with 15N shifts offset by more than 1.5 ppm. From this study, 19.7% of the entries in the BMRB appear to be mis-referenced. Evidently, chemical shift referencing continues to be a significant, and as yet unresolved problem for the biomolecular NMR community <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref><ref name=pmid20446018></ref>[1,15].
In a recent study <ref name=pmid20446018></ref>, a chemical shift re-referencing program (PANAV) was run on a total of 2421 BMRB entries that had a sufficient proportion of (>80%) of assigned chemical shifts to perform a robust chemical shift reference correction. A total of 243 entries were found with 13Cα shifts offset by more than 1.0 ppm, 238 entries with 13Cβ shifts offset of more than 1.0 ppm, 200 entries with 13C’ shifts offset of more than 1.0 ppm and 137 entries with 15N shifts offset by more than 1.5 ppm. From this study, 19.7% of the entries in the BMRB appear to be mis-referenced. Evidently, chemical shift referencing continues to be a significant, and as yet unresolved problem for the biomolecular NMR community <ref name= pmid21241884>{{cite journal |title=Interpreting protein chemical shift data.|last=Wishart|first=DS|journal=Progress in Nuclear Magnetic Resonance Spectroscopy|date=Feb 2011|volume=85|page=62-87|pmid=21241884}}</ref><ref name=pmid20446018></ref>.


==See also==
==See also==
Line 67: Line 66:
==References==
==References==
{{reflist}}
{{reflist}}

==General References==
*{{cite journal |title= The chemical shift index - a fast and simple method for the assignment of protein secondary structure through NMR spectroscopy.|last= Wishart|first=DS|coauthors= Sykes BD, Richards FM|journal= Biochemistry|date=1992|volume=31|page=1647–1651|pmid=1737021}}
*{{cite journal |title= The C-13 chemical-shift index - a simple method for the identification of protein secondary structure using C-13 chemical-shift data.|last= Wishart|first=DS|coauthors= Sykes B |journal= Journal of Biomolecular NMR |date=1994|volume=4|page=171–180|pmid=8019132}}
*{{cite journal |title= Analysis of proton chemical shifts in regular secondary structure of proteins.|last= Osapay|first=K|coauthors= Case DA|journal=Journal of Biomolecular NMR.|date=1994|volume=4|page=215-230|pmid=8019135}}
*{{cite journal |title= Identification of N-terminal helix capping boxes by means of 13C chemical shifts.|last= Gronenborn|first=AM|coauthors= Clore GM|journal=Journal of Biomolecular NMR.|date=1994|volume=4|page=455-458|pmid=8019146}}
*{{cite journal |title= Assignment validation software suite for the evaluation and presentation of protein resonance assignment data.|last= Moseley|first=HN|coauthors= Sahota G, Montelione GT|journal=Journal of Biomolecular NMR.|date=2004|volume=28|page=341-355|pmid=14872126}}

<!--- After listing your sources please cite them using inline citations and place them after the information they cite. Please see http://en.wikipedia.org/wiki/Wikipedia:REFB for instructions on how to add citations. --->
<!--- After listing your sources please cite them using inline citations and place them after the information they cite. Please see http://en.wikipedia.org/wiki/Wikipedia:REFB for instructions on how to add citations. --->
*
*

Revision as of 18:12, 25 March 2014

Importance of NMR chemical shift re-referencing in biomolecular NMR

Incorrect chemical shift referencing is a particularly acute problem in biomolecular NMR [1]. It has been estimated that up to 20% of 13C and up to 35% of 15N shift assignments are improperly referenced [2] [3] [4]. Given that the structural and dynamic information contained within chemical shifts is often quite subtle, it is critical that protein chemical shifts be properly referenced so that these subtle differences can be detected. Fundamentally, the problem with chemical shift referencing comes from the fact that chemical shifts are relative frequency measurements rather than absolute frequency measurements. Because of the historic problems with chemical shift referencing, chemical shifts are perhaps the most precisely measurable but the least accurately measured parameters in all of NMR spectroscopy [5] [3].

Programs for protein chemical shift re-referencing

Because of the magnitude and severity of the problems with chemical shift referencing in biomolecular NMR, a number of computer programs have been developed to help mitigate the problem (see Table 1 for a summary). The first program to comprehensively tackle chemical shift mis-referencing in biomolecular NMR was SHIFTCOR [2].


Table 1. Summary and comparison of different chemical shift re-referencing and mis-assignment detection programs [5].

Program [Reference] Detects or performs shift re-referencing Detects gross assignment errors Detects subtle assignment errors Distinguishes assignment errors from referencing errors Requires 3D structure
CheckShift [6][7] Yes No No No No
AVS[8] No Yes No No No
LACS [9][4] Yes No No No No
PSSI [10] Yes No No No No
SHIFTCOR [2] Yes Yes Sometimes Yes Yes
PANAV [11] Yes Yes Yes Yes Yes

SHIFTCOR: A structure-based chemical shift correction program

SHIFTCOR is an automated protein chemical shift correction program that uses statistical methods to compare and correct predicted NMR chemical shifts (derived from the 3D structure of the protein) relative to an input set of experimentally measured chemical shifts. SHIFTCOR uses several simple statistical approaches and pre-determined cut-off values to identify and correct potential referencing, assignment and typographical errors. SHIFTCOR identifies potential chemical shift referencing problems by comparing the difference between the average value of each set of observed backbone (1Hα, 13Cα, 13Cβ, 13CO, 15N and 1HN) shifts and their corresponding predicted chemical shifts. The difference between these two averages results in a nucleus-specific chemical shift offset or reference correction (i.e. one for 1H, one for 13C and one for 15N). In order to ensure that certain extreme outliers do not unduly bias these average offset values, the average of the observed shifts is only calculated after excluding potential mis-assignments or typographical errors [2].

SHIFTCOR output:

File:SHIFTCOR fig1.png
Figure 1. An example SHIFTCOR output for bmr4375.str

SHIFTCOR generates and reports chemical shift offsets or differences for each nucleus. The results contain the chemical shift analyses (including lists of potential mis-assignments, the estimated referencing errors, the estimated error in the calculated reference offset (95% confidence interval), the applied or suggested reference offset, correlation coefficients, RMSD values) and the corrected BMRB formatted chemical shift file (see Figure 1 for details) [2].

SHIFTCOR uses the chemical shift calculation program SHIFTX [12] to predict 1Hα, 13Cα,15N shifts based on the 3D structure coordinates of the protein being analyzed. By comparing the predicted shifts to the observed shifts, SHIFTCOR is able to accurately identify chemical shift reference offsets as well as potential mis-assignments. A key limitation to the SHIFTCOR approach is that requires that the 3D structure for the target protein be available to assess the chemical shift reference offsets. Given that chemical shift assignments are typically made before the structure is determined, it was soon realized that structure-independent approaches were required to develop [5].

Structure-independent chemical shift correction programs

Several methods have been developed that make use of the estimated (via 1H or 13C shifts) or predicted (via sequence) secondary structure content of the protein being analyzed. These programs include PSSI [10], CheckShift [6], [7], LACS [9], [4] and PANAV [11]. Both PANAV (http://www.wishartlab.com/web_servers/panav) and CheckShift (http://checkshift.services.came.sbg.ac.at/) are also available as web servers.

The PSSI and PANAV programs use the secondary structure determined by 1H shifts (which are almost never mis-referenced) to adjust the target protein’s 13C and 15N shifts to match the 1H-derived secondary structure. LACS uses the difference between secondary 13Cα and 13Cβ shifts plotted against secondary 13Cα shifts or secondary 13Cβ shifts to determine reference offsets. A more recent version of LACS [68] has been adapted to identify 15N chemical shift mis-referencing. This new version of LACS exploits the well-known relationship between 15N shifts and the 13Cα (or 13Cβ shifts of the preceding residue [3]. In contrast to LACS and PANAV/PSSI, CheckShift uses secondary structure predicted from high-performance secondary structure prediction programs such as PSIPRED [13] to iteratively adjust 13C and 15N chemical shifts so that their secondary shifts match the predicted secondary structure. These programs have all been shown to accurately identify mis-referenced and properly re-reference protein chemical shifts deposited in the BMRB [7], [11]. Note that both LACS and CheckShift are programmed to always predict the same offset for 13Cα and 13Cβ shifts, whereas PSSI and PANAV do not make this assumption. As a general rule, PANAV and PSSI typically exhibit a smaller spread (or standard deviation) in calculated reference offsets, indicating that these programs are slightly more precise than either LACS or CheckShift. Interestingly, neither LACS nor CheckShift are able to handle proteins that have the extremely large (above 40 ppm) reference offsets, whereas PANAV and PSSI seem to be able to deal with these kinds of anomalous proteins [11].

In a recent study [11], a chemical shift re-referencing program (PANAV) was run on a total of 2421 BMRB entries that had a sufficient proportion of (>80%) of assigned chemical shifts to perform a robust chemical shift reference correction. A total of 243 entries were found with 13Cα shifts offset by more than 1.0 ppm, 238 entries with 13Cβ shifts offset of more than 1.0 ppm, 200 entries with 13C’ shifts offset of more than 1.0 ppm and 137 entries with 15N shifts offset by more than 1.5 ppm. From this study, 19.7% of the entries in the BMRB appear to be mis-referenced. Evidently, chemical shift referencing continues to be a significant, and as yet unresolved problem for the biomolecular NMR community [5][11].

See also

External Links

References

  1. ^ Wishart, DS (1995). "1H, 13C and 15N chemical shift referencing in biomolecular NMR". Journal of Biomolecular NMR. 6: 135-40. PMID 8589602. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  2. ^ a b c d e Zhang, H (Mar 2003). "RefDB: A database of uniformly referenced protein chemical shifts". J. Biomol. NMR. 25: 173-195. doi:10.1023/A:1022836027055. PMID 12652131. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  3. ^ a b c Wishart, DS (2001). "Use of chemical shifts in macromolecular structure determination". Methods in Enzymology. 338: 3-34. PMID 11460554. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  4. ^ a b c Wang, L (2009). "Empirical correlation between protein backbone 15N and 13C secondary chemical shifts and its application to nitrogen chemical shift re-referencing". Journal of Biomolecular NMR. 44: 95-99. PMID 19436955. Cite error: The named reference "pmid19436955" was defined multiple times with different content (see the help page).
  5. ^ a b c d Wishart, DS (Feb 2011). "Interpreting protein chemical shift data". Progress in Nuclear Magnetic Resonance Spectroscopy. 85: 62-87. PMID 21241884.
  6. ^ a b Ginzinger, SW (2007). "CheckShift: automatic correction of inconsistent chemical shift referencing". Journal of Biomolecular NMR. 39: 223-227. PMID 17899394. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  7. ^ a b c Ginzinger, SW (2009). "CheckShift improved: fast chemical shift reference correction with high accuracy". Journal of Biomolecular NMR. 44: 207-211. PMID 19575298. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  8. ^ Moseley, NH (Jul 2004). "Assignment validation software suite for the evaluation and presentation of the protein resonance assignment data". Journal of Biomolecular NMR. 28: 341-355. PMID 14872126. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  9. ^ a b Wang, L (May 2005). "Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications". Journal of Biomolecular NMR. 32: 13-22. PMID 16041479. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  10. ^ a b Wang, Y (2005). "A simple method to adjust inconsistently referenced 13C and 15N chemical shift assignments of proteins". Journal of Biomolecular NMR. 31: 143-148. PMID 15772753. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  11. ^ a b c d e f Wang, B (2010). "A probabilistic approach for validating protein NMR chemical shift assignments". Journal of Biomolecular NMR. 47: 85-99. PMID 20446018. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  12. ^ Neal, S (Jul 2003). "Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts". Journal of Biomolecular NMR. 26: 215-240. doi:10.1023/A:1023812930288. PMID 12766419. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  13. ^ McGuffin, LJ (2000). "The PSIPRED protein structure prediction server". Bioinformatics. 16: 404–405. PMID 10869041. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)

General References

  • Wishart, DS (1992). "The chemical shift index - a fast and simple method for the assignment of protein secondary structure through NMR spectroscopy". Biochemistry. 31: 1647–1651. PMID 1737021. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Wishart, DS (1994). "The C-13 chemical-shift index - a simple method for the identification of protein secondary structure using C-13 chemical-shift data". Journal of Biomolecular NMR. 4: 171–180. PMID 8019132. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Osapay, K (1994). "Analysis of proton chemical shifts in regular secondary structure of proteins". Journal of Biomolecular NMR. 4: 215-230. PMID 8019135. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Gronenborn, AM (1994). "Identification of N-terminal helix capping boxes by means of 13C chemical shifts". Journal of Biomolecular NMR. 4: 455-458. PMID 8019146. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  • Moseley, HN (2004). "Assignment validation software suite for the evaluation and presentation of protein resonance assignment data". Journal of Biomolecular NMR. 28: 341-355. PMID 14872126. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)