Structure Calculation Using CS-DP ROSETTA: Difference between revisions
No edit summary |
No edit summary |
||
Line 5: | Line 5: | ||
<br> | <br> | ||
== ''' | == '''Protocols''' == | ||
=== Prerequisites === | === Prerequisites === | ||
Line 13: | Line 13: | ||
**Side chain assignment: manual | **Side chain assignment: manual | ||
*3D <sup>13</sup>C- and <sup>15</sup>N-edited NOESY spectra<br> | *3D <sup>13</sup>C- and <sup>15</sup>N-edited NOESY spectra<br> | ||
=== Peak picking procedures === | |||
Generating raw NOESY peaks lists by automatic peak picking: | |||
*The <sup>13</sup>C and <sup>15</sup>N-edited 3D-NOESY raw peak lists are prepared systematically using the program SPARKY by automatic peak picking the 3D spectrum using the 2D <sup>1</sup>H,<sup>13</sup>C and <sup>1</sup>H,<sup>15</sup>N-HSQC as root spectra and 0.02 and 0.2 ppm as the pick-picking tolerances in the indirect <sup>1</sup>H and heavy atom dimensions, respectively. | |||
Generating refined NOESY peaks lists: | |||
*Refined NOESY peak lists are obtained by expert manual editing which involves artifact removal, picking of overlapped peaks and(or) picking starting from a script-based 3D-HcCH intra/sequential-residue NOE list that was manually extended to the long range unpicked resonances. | |||
=== CS-DP-ROSETTA procedure<br> === | |||
'''Input for CS-DP-Rosetta:''' | |||
*Protein sequence file (fasta format) | |||
*chemical shift file (BMRB2.1 format) | |||
*unassigned raw or refined NOESY peak lists files (Xeasy format or Sparky format).<br> | |||
'''Step 1: Generating ~50000 CS-Rosetta decoys'''<br><br>I. Generate 3-mer & 9-mer fragments library based on sequence and chemical shift information by Rosetta Fragments server or Rosetta software suite<br>II. Use CS-Rosetta protocol to generate ~50,000 decoys, keep 1,000 decoys with lowest Rosetta energy<br><br>'''Step 2: Filter decoys by a linear combination of Rosetta energy and DP-Score'''<br> <br>I. Prepare sequence file and chemical shift file in BMRB format, and peak list files can be Sparky, Xeasy or any other table format. Use AutoStructure 2.2.1 GUI to make control file for RPF calculation. By default, tolerance for <sup>13</sup>C and <sup>15</sup>N is set to 0.5, and tolerance for the <sup>1</sup>H dimensions is set to 0.05.<br>Make sure sequence file, chemical shift file and all the peak list files in the project directory <br>(1) $Autostructure_install_path/bin/asgui.<br>(2) Select File->New Control File, a “Control File Display” window would appear.<br>(3) In the “General Section” tab, input “Protein Name”, select “Sequence File”, “Chemical Shift File”, set “Iterative Analysis Cycles” to 1, then select tab “PeakList Section”, add peak list files one by one. Save the control file after all the peak lists have been added. <br> <br>II. Calculate DP-Score for the 1000 decoys with lowest Rosetta energy using RPF module of AutoStructure 2.2.1. <sub></sub><br>Command: | |||
<pre>$Autostructure_install_path/bin/autostructure -c control_file –o rpf_output_path -q path_of_query_structure -s | |||
</pre> | |||
Extract DP-score from rpf_output_path/*.ovw file <br><br>III. Calculate DP-Score for the 1,000 decoys with lowest Rosetta energy using RPF module of AutoStructure 2.2.1 <br>IV. Calculate target function for each decoy: | |||
<pre>t<sub>i</sub> = CS-Rosetta all-atom energy<sub>i </sub>+ 1000*(1 – DP-score<sub>i</sub>) | |||
</pre> | |||
<br>V. Rank 1000 decoys based on t<sub>i</sub> , keep the first 20 lowest t<sub>i</sub> decoys for further model rebuild-and-refinement<br><br>'''Step 3: Model rebuilding and refinement for 20 lowest t<sub>i</sub> decoys'''<br> <br>I. Identifying flexible regions which have largest C-alpha deviations within the 20 lowest t<sub>i</sub> decoys <br>II. Flexible regions are stochastically rebuilt by fragment insertion and CCD loop closure <br>III. Using physically realistic Rosetta force field to perform all-atom refinement for the whole structure<br>IV. The best 10 models with the lowest t<sub>i</sub> are saved as the final CS-DP-Rosetta models<br><br><br><br> | |||
== '''References''' == | == '''References''' == | ||
1. Raman, S., Huang, Y. J., Mao, B., Rossi, P., Aramini, J. M., Liu, G., Montelione, G. T., and Baker, D. (2010) Accurate automated protein NMR structure determination using unassigned NOESY data. <span style="font-style: italic;">J</span>''. Am. Chem. Soc.'', in press. | 1. Raman, S., Huang, Y. J., Mao, B., Rossi, P., Aramini, J. M., Liu, G., Montelione, G. T., and Baker, D. (2010) Accurate automated protein NMR structure determination using unassigned NOESY data. <span style="font-style: italic;">J</span>''. Am. Chem. Soc.'', in press. | ||
-- JanetHuang/JimAramini - Nov 2009 |
Revision as of 19:26, 27 November 2009
Introduction
The CS-DP-Rosetta approach (Ref. 1) merges the ideas of model generation using CS-Rosetta with model filtering by agreement to NOESY data via the DP-score from the RPF program to generate high accuracy protein structures. This hybrid approach uses both local backbone chemical shift data (CS-Rosetta) and unassigned NOESY data (DP-filtering) to direct Rosetta trajectories toward the native structure, producing more accurate models than CS-Rosetta alone. Given a raw (or refined) NOESY peak list and chemical shift (backbone and extensive sidechain) information, the DP-Score is used as a filter to effectively guide the trajectory of CS-Rosetta decoy generation, significantly reducing the search space. Since the NOESY peak list data are not directly included in structure calculation, CS-DP-Rosetta is much more robust with respect to the quality of these peak lists compared to methods which attempt to assign each NOESY peak to one or more specific interproton interactions.
Protocols
Prerequisites
- Complete 1H, 13C, and 15N resonance assignments using either conventional triple resonance or GFT approaches.
- Backbone assignment: using either AutoAssign or PINE.
- Side chain assignment: manual
- 3D 13C- and 15N-edited NOESY spectra
Peak picking procedures
Generating raw NOESY peaks lists by automatic peak picking:
- The 13C and 15N-edited 3D-NOESY raw peak lists are prepared systematically using the program SPARKY by automatic peak picking the 3D spectrum using the 2D 1H,13C and 1H,15N-HSQC as root spectra and 0.02 and 0.2 ppm as the pick-picking tolerances in the indirect 1H and heavy atom dimensions, respectively.
Generating refined NOESY peaks lists:
- Refined NOESY peak lists are obtained by expert manual editing which involves artifact removal, picking of overlapped peaks and(or) picking starting from a script-based 3D-HcCH intra/sequential-residue NOE list that was manually extended to the long range unpicked resonances.
CS-DP-ROSETTA procedure
Input for CS-DP-Rosetta:
- Protein sequence file (fasta format)
- chemical shift file (BMRB2.1 format)
- unassigned raw or refined NOESY peak lists files (Xeasy format or Sparky format).
Step 1: Generating ~50000 CS-Rosetta decoys
I. Generate 3-mer & 9-mer fragments library based on sequence and chemical shift information by Rosetta Fragments server or Rosetta software suite
II. Use CS-Rosetta protocol to generate ~50,000 decoys, keep 1,000 decoys with lowest Rosetta energy
Step 2: Filter decoys by a linear combination of Rosetta energy and DP-Score
I. Prepare sequence file and chemical shift file in BMRB format, and peak list files can be Sparky, Xeasy or any other table format. Use AutoStructure 2.2.1 GUI to make control file for RPF calculation. By default, tolerance for 13C and 15N is set to 0.5, and tolerance for the 1H dimensions is set to 0.05.
Make sure sequence file, chemical shift file and all the peak list files in the project directory
(1) $Autostructure_install_path/bin/asgui.
(2) Select File->New Control File, a “Control File Display” window would appear.
(3) In the “General Section” tab, input “Protein Name”, select “Sequence File”, “Chemical Shift File”, set “Iterative Analysis Cycles” to 1, then select tab “PeakList Section”, add peak list files one by one. Save the control file after all the peak lists have been added.
II. Calculate DP-Score for the 1000 decoys with lowest Rosetta energy using RPF module of AutoStructure 2.2.1.
Command:
$Autostructure_install_path/bin/autostructure -c control_file –o rpf_output_path -q path_of_query_structure -s
Extract DP-score from rpf_output_path/*.ovw file
III. Calculate DP-Score for the 1,000 decoys with lowest Rosetta energy using RPF module of AutoStructure 2.2.1
IV. Calculate target function for each decoy:
t<sub>i</sub> = CS-Rosetta all-atom energy<sub>i </sub>+ 1000*(1 – DP-score<sub>i</sub>)
V. Rank 1000 decoys based on ti , keep the first 20 lowest ti decoys for further model rebuild-and-refinement
Step 3: Model rebuilding and refinement for 20 lowest ti decoys
I. Identifying flexible regions which have largest C-alpha deviations within the 20 lowest ti decoys
II. Flexible regions are stochastically rebuilt by fragment insertion and CCD loop closure
III. Using physically realistic Rosetta force field to perform all-atom refinement for the whole structure
IV. The best 10 models with the lowest ti are saved as the final CS-DP-Rosetta models
References
1. Raman, S., Huang, Y. J., Mao, B., Rossi, P., Aramini, J. M., Liu, G., Montelione, G. T., and Baker, D. (2010) Accurate automated protein NMR structure determination using unassigned NOESY data. J. Am. Chem. Soc., in press.
-- JanetHuang/JimAramini - Nov 2009