XEASY Backbone Assignment: Difference between revisions

From NESG Wiki
Jump to navigation Jump to search
No edit summary
(Blanked the page)
 
(10 intermediate revisions by one other user not shown)
Line 1: Line 1:
== '''Backbone Assignment with XEASY/UBNMR''' ==


Sequential backbone and 13CB resonance assignment is associated with mapping of SRDs identified in spin system identification onto the polypeptide sequence. This is accomplished using two (4,3)D GFT NMR experiments, that is, HNNCABCA and CABCA(CO)NHN, or using two non-GFT experiments HNNCACB/CACBCONHN.
=== '''Analysis of the (4,3)D GFT HNNCABCA and CABCA(CO)NHN spectra'''  ===
HNNCABCA comprises of peaks representing both intra-residue and sequential connectivities (as in HNNCACB). These can be used to sort SRDs in sequential order, and to then assign them to specific residues in the primary structure. Since the intra-residue connectivities are often comparably weak, this experiment is routinely combined with CABCA(CO)NHN (which comprises, as CBCA(CO)NHN, sequential connectivities only).
<br>
#'''Peak Picking''' <br>
##Go to <tt>analysis/xeasy/backbone</tt>. Edit macro <tt>getfil</tt> to import backbone spectra, sequence, prot and peaklists. Edit macro [[Media:XEASY_makeCabcaPeak.txt|makeCabcaPeak]] with the sequence and prot file names. In UBNMR, run macro <tt>makeCabcaPeak</tt> to generate an extended GFT AtomList for backbone assignment which contains linear combinations of 13CA and 13CB shifts for each residue and SRD; to generate a starting peaklist for analysis of HNNCABCA and CABCA(CO)NHN by using the 15N/1HN backbone shifts of SRD-I. Intraresidue 13C shifts are assigned to '''SRD-I''' numbers, while 13C shifts of the residue preceding '''SRD-I''' are assigned to '''SRD-II''' numbers. This results in a single CABCA-peak list for the four sub-spectra of the two GFT NMR experiments. Peaks are colored according to sub-spectrum and intra- or sequential connectivity. This procedure allows one to efficiently handle sequential connectivities and ensure efficient book-keeping during the assignment process. <br>
##In XEASY, use <tt>ns</tt>, <tt>ls</tt>, <tt>lc</tt>, and <tt>lp</tt> to load HNNCABCA and CABCA(CO)NHN spectra and the SequenceList, ProtonList and PeakList; use <tt>se</tt>, <tt>gs</tt>, <tt>fs</tt> and <tt>bs</tt> to sort and display strips (Figure 1A); use <tt>mr</tt> to identify and move peaks (Figure 1B); start with CABCA(CO)NHN sub-spectra and continue with HNNCABCA sub-spectra (remove unobserved peaks); use <tt>ra</tt> regularly to check on the quality of the PeakList; <tt>ac</tt>, <tt>wc</tt> and <tt>wp</tt> to save updated lists. <br> <br> '''Figure 1: Peak picking of (4,3)D GFT HNNCABCA and CABCA(CO)NHN spectra.''' <br> '''A: Before peak position adjustment by <tt>mr</tt>;''' <br> [[File:XEASY_backbone1.jpg]] <br> <br> '''B: After peak position adjustment by <tt>mr</tt>.''' <br> [[File:XEASY_backbone2.jpg]] <br> <br>
#'''Initial Backbone Assignment from AutoAssign''' <br>
##Go to <tt>/analysis/xeasy/backbone/autos</tt>, read file [[Media:XEASY autos README.txt|autos_README]] for instructions on how to edit the macro [[Media:XEASY makeAutoList.txt|makeAutoList]] and file <tt>myprot.aat</tt>. In UBNMR run <tt>makeAutoList</tt> to generate input files for AutoAssign. Erros at this step can be corrected by looking closely at the peaks reported, moving them again if needed, and running UBNMR makeAutoList again, until all erros are eliminated. The peak ID of one simulated AutoAssign input file ('''myprot-hsqc.pks''') correspond to the residue number of '''SRD-I''', which is required for this protocol. <br> '''Since HNNCABCA and CABCA(CO)NHN spectra provide 4D information, it is suggested that generate the 4D peak list for AutoAssign in order to take full advantage of GFT spectra and get a better assignment results.'''
***Click for the AutoAssign control file example [[Media:XEASY myprot.aat|myprot.aat]].
***Peak pattern used in the AutoAssign input file of 3D CACB type experiments: HN(i), N(i), CA(i or i-1) or CB(i or i-1);
***Peak pattern used in the AutoAssign input file of 4D CACB type experiments: HN(i), N(i), CA(i), CA(i) or CB(i); and HN(i), N(i), CA(i-1), CA(i-1) or CB(i-1).<br>
##Run AutoAssign several times with varying matching tolerances for CA and CB in the myprot.aat file (0.1-0.6ppm); save AutoAssign output files;<br>
***It is suggested to use <tt>Default Execution</tt> method to run the program.
***From the main menu of AutoAssign, follow option <tt>Examine</tt> &gt; <tt>All GSs</tt> to write output file that contains both AutoAssign assignment and the corresponding '''SRD-I''' residue numbers. <br>
##In UBNMR, run macro [[Media:XEASY AA2Xeasy.txt|AA2Xeasy]] to use the best or consensus AutoAssign output file to complement the SRD-I and SRD-II entries of the XEASY SequenceList with <tt>mapping numbers</tt>. <br> <br>
<br> Modified macro for AA2XEASY
<pre>
#UBNMR script for converting autoassign results to xeasy format sequence
init
read seq ../nhsqc.seq
<br>
#update autoassign results to xeasy sequence
update mapping aa.out 200
#200 is the number difference between self and sequencial SRD's.
#write updated sequence
write sequence aa.seq
</pre>
<br>
#'''Confirming Backbone Assignment from AutoAssign in XEASY''' <br> Here two XEASY sessions are recommended for efficiency reasons, one is HNNCABCA/CABCACONHN analysis, the other is 15N NOESY analysis. For each [w1(13CA;13CAB),w3(1HN)]-strip corresponding to the 15N/1HN shifts of a given SRD-1, one expects to observe up to four [two] peaks in the two sub-spectra of (4,3)D HNNCABCA[CABCA(CO)NHN]. In the following, sequential ordering of these strips in XEASY is described. This leads to "sequential walks" in the two sub-spectra along the polypeptide chain. Sequential connectivities are confirmed in 15N-resolved NOESY '''Also see below the use Of 15N-resolved [1H,1H] NOESY Spectrum).''' <br>
##In UBNMR, run macro [[Media:XEASY makeBbNNoesy.txt|makeBbNNoesy]] to use the 15N /1HN backbone shifts of SRD-I to generate a starting peaklist that only contains diagonal peaks for analysis of the 15N-resolved part of simultaneous 3D 15N/13Caliphatic/13Caromatic-resolved [1H,1H]-NOESY.
##In XEASY session I, <tt>ns</tt> to load the four sub-spectra of (4,3)D HNNCABCA/CABCACONHN; use <tt>ls</tt> to load the SequenceList that contains AutoAssign results; <tt>lc</tt> to load the AtomList; use <tt>lp</tt> to load the CABCA-PeakList.
##In XEASY session II, use <tt>ns</tt> to load 15N-resolved NOESY; use <tt>ls</tt> to load the SequenceList that contains AutoAssign results; use <tt>lp</tt> to load the corresponding starting peak list. There is no need to move any peaks at this time.
##In XEASY session I and II, use <tt>sn</tt> to swap fragment number and mapping; use <tt>se</tt> to sort strips; use <tt>ls</tt> and <tt>lc</tt> to load the SequenceList (with AutoAssign assignment) and the atomlist; use <tt>gs</tt> to display the strips. Now the strips are sorted in the following order: Assigned SRD residues in sequential order followed by unassigned SRD residues in SRD -1 sequential order.
##In XEASY session I and II, check and confirm sequential connectivities for the assigned SRD residues. Use <tt>ed</tt> to modify the mapping number if the assignment is wrong.
##Continue to complete the backbone assignment manually as described below<br> <br>
#'''Complete Backbone Assignment by perform sequential ordering of SRDs''' <br> Here two XEASY sessions are recommended for efficiency reasons, one is HNNCABCA/CABCACONHN analysis, the other is 15N NOESY analysis. For each [w1(13CA;13CAB),w3(1HN)]-strip corresponding to the 15N/1HN shifts of a given SRD-1, one expects to observe up to four [two] peaks in the two sub-spectra of (4,3)D HNNCABCA[CABCA(CO)NHN]. In the following, sequential ordering of these strips in XEASY is described. This leads to "sequential walks" in the two sub-spectra along the polypeptide chain. Sequential connectivities are confirmed in 15N-resolved NOESY.
##In XEASY session I, use <tt>es</tt>, <tt>se</tt>, <tt>gs</tt> to select two strips exhibiting four well resolved peaks arising from a Ser, Thr, Ala residue in (4,3)D HNNCABCA or terminal residues of an assigned segment assigned from AutoAssign as a starting point; use <tt>sh</tt> to put those strips "on hold".
##In XEASY session I, use <tt>rd</tt> and <tt>pc</tt> to search for sequential neigbours.
##In XEASY session I, using <tt>fc</tt> and <tt>bc</tt> to inspect the sorted strips in order to identify the strip containing the sequential neighbor and use <tt>sh</tt> to put "on hold".
##In XEASY session II, use <tt>cd</tt> / <tt>cc</tt> to confirm sequential connectivities in 15N-resolved NOESY.
##In XEASY, repeat steps 1 to 4 until you have identified a maximal set of strips you can map to the polypeptide sequence.
##In XEASY, use <tt>ed</tt> to edit peak entries, type in "mapping numbers" which link the SRD-I and SRD-II numbers to residue numbers.
##In XEASY, repeat 5 and 6 until analysis is complete
##In XEASY, use <tt>aa</tt>, <tt>ac</tt>, <tt>ws</tt>, <tt>wc</tt> and <tt>wp</tt> to save all XEASY files before switching from SRD to sequential amino acid residue numbering using <tt>sn</tt>. This yields modified SequenceList, AtomList and PeakList which should be saved, and re-loaded and saved a second time.
##In UBNMR, run <tt>cleanBbGftProt</tt> to make a clean AtomList and SeqList by deleting extra atoms and SRDs, fixing nomenclature, and updating single-quantum 13CA and 13CA shifts.
<pre>
#UBNMR macro cleanBbGftProt
init
read seq nhsqc.seq
read prot bbgft-swapped.prot append
update atom GFTatom CA
update atom GFTatom CB
remove SRDs
write seq final-clean.seq
remove GFTatoms
write prot final-clean.prot
</pre>
=== '''Analysis of the(4,2)D GFTHNNCABCA and CABCA(CO)NHN Spectra''' ===
In case of NMR data were collected with (4,2)D GFT HNNCABCA and CABCA(CO)NHN for backbone assignment, one can treat them the same as previously described (4,3)D GFT HNNCABCA/CABCA(CO)NHN analysis. Since the protocol of analysis of the (4,3)D GFT HNNCABCA and CABCA(CO)NHN spectra is based on analysis the stips residue by residue, it can be completely adapted to the analysis of the(4,2)D GFTHNNCABCA and CABCA(CO)NHN Spectra. Simply do the same things as described above: <br/>
# Simulating the intial peak lists;
# Adjusting peak position in strips of all residues with XEASY;
# Calculating single quantum chemical shifts from GFT shift linear combinations
# Simulating AutoAssign input files and do the initially backbone assignment by AutoAssign;
# Maually checking results from AutoAssign and completing the assignment.<br/>
=== '''Analysis of the HNNCACB/CACB(CO)NHN''' ===
In case of NMR data were collected with  non-GFT NMR experiments such as HNNCACB/CACß(CO)NHN for backbone assignment, one can treat HNNCACB/CACB(CO)NHN the same as previously described (4,3)GFT HNNCABCA/CABCA(CO)NHN analysis. Simply do the same things as described in analysis of the (4,3)D GFT HNNCABCA and CABCA(CO)NHN spectra: <br/>
# Simulating the intial peak lists;
# Adjusting peak position in strips of all residues with XEASY;
# Simulating AutoAssign input files and do the initially backbone assignment by AutoAssign;
# Maually checking results from AutoAssign and completing the assignment.<br/>
=== '''Use Of 15N-resolved [1H,1H] NOESY Spectrum''' ===
Sequential NOEs observed in 15N-resolved [1H,1H] NOESY Spectrum can be used to confirm and facilitate backbone assignment. A convinient way is described here on how to do the sequential assignment by using the 15N-resolved [1H,1H] NOESY spectrum. Please check section '''Confirming Backbone Assignment from AutoAssign in XEASY''', [*Complete Backbone Assignment by perform sequential ordering of SRDs* and [[NESG:Side Chain Assignment|Side Chain Assignment]] for addtional and other informations.
# In UBNMR, run <tt>makeBbNNoesy</tt> to use the 15N /1HN backbone shifts of SRD-I to generate a starting peaklist that only contains diagonal peaks for analysis of the 15N-resolved part of simultaneous 3D 15N/13Caliphatic/13Caromatic-resolved [1H,1H]-NOESY.
# In XEASY, use <tt>ns</tt> to load 15N-resolved NOESY; use <tt>ls</tt> to load the SequenceList; use <tt>lp</tt> to load the corresponding starting peak list. There is no need to move any peaks at this time.
# In XEASY, use <tt>cd</tt> and <tt>cc</tt> to find sequential connectivities in 15N-resolved NOESY.
## After generating a strip list, type <tt>cd</tt> to calculated the the correlation coefficient between every two strips. <tt>he [cd</tt> command produces a pop-up window with the default values shown below. <br/> '''Figure 3. The pop-up window after command <tt>cd</tt>.''' <br />    <img src="%ATTACHURLPATH%/cd.jpg" alt="cd.jpg" width='390' height='282' /> <br /> <br /> Please see the Xeasy manual for specifics regarding these parameters.  The default parameters are sufficient for 15N resolved [1H, 1H]-NOESY.  For searching other spectra (triple resonance, etc.) the "Factor to enhance peaks" and "Factor to reduce peaks" should both be set to "1.0 1.0".  The choices for inter- and intra-spectral searches and "use absolute intensities" should also be considered.
## Use <tt>gs</tt>, <tt>sf</tt>, <tt>sf</tt> or <tt>bf</tt> to display the intersted strip on the screen.
## Typing <tt>cc</tt>, then click on any position in the strip interested.  Xeasy will then sort all strips according to the correlation coifficients of these strips to the clicked strips. The strips sequential residues often give the highest correlation coifficients and can be easily identified in this way. The identified sequential residues need to be confirmed from the triple resonance spectra.
=== '''Assignment Validation''' ===
Once the assignment results were obtained from, it is required to validate the assignments.  AutoAssign provides Assignment Validation Software suite (AVS) to aid the user in validating assignments.  Please check the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/tutorial.html AutoAssign Tutorial] for detailed usage. The four main tools in AVS are:
* '''missing_shifts.pl''' - shows which shifts in a BMRB file are missing and statistics on assignment completeness. <br/> Usage: missing_shifts.pl [ options ] input_bmrb_file <br/>
* '''validate_assignments.pl''' - performs a statistical evaluation of assigned chemical shifts in a BMRB file. <br/> Usage: validate_assignments.pl [options] input_bmrb_file <br/>
* '''typing_degeneracy.pl''' - performs a statistical evaluation on the uniqueness of the protein sequence and the uniqueness of mapped segments of spin systems. <br/> Usage:  typing_degeneracy.pl [options] <br/>
* '''CMap Image Editor''' - generates graphical representations of a protein's resonance assignments and all data supporting the assignments. Please check [[NESG:http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/cmap_image_editor.html|http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/cmap_image_editor.html]] for more detail.
%COMMENT%
<span class="commentPlugin commentPluginOutputOneliner">  * Gaohua, where can i find the makeBbNNoesy script for UBNMR? it is not on the directory tree, or in the scripts directory. Are there other scripts here that are not being used anymore? thanks! -- Main.AnaZeri - 29 Aug 2007 - 09:45</span></span><!--/commentPlugin-->
<span class="commentPlugin commentPluginOutputOneliner">  * Gaohua, the script A2NeSGC.XEASY is not working, and the guys here at Buffalo use a different one, updatemapping. what shall we do about this? Thanks, Ana -- Main.AnaZeri - 28 Aug 2007 - 11:41</span></span><!--/commentPlugin-->
-- Main.GaohuaLiu - 04 Feb 2007
files for download:
* [[NESG:%ATTACHURL%/makeCabcaPeak|makeCabcaPeak]]: makeCabcaPeak
* [[NESG:%ATTACHURL%/makeBbNNoesy|makeBbNNoesy]]: makeBbNNoesy
* [[NESG:%ATTACHURL%/AA2Xeasy|AA2Xeasy]]: AA2Xeasy
* [[NESG:%ATTACHURL%/makeAutoList|makeAutoList]]: makeAutoList
* [[NESG:%ATTACHURL%/myprot.aat|myprot.aat]]: myprot.aat
* [[NESG:%ATTACHURL%/autos_README|autos_README]]: autos_README

Latest revision as of 20:18, 24 November 2009