Resonance Assignment/AutoAssign: Difference between revisions

From NESG Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
(9 intermediate revisions by 2 users not shown)
Line 1: Line 1:
== '''Introduction'''  ==
== '''Introduction'''  ==


[http://www-nmr.cabm.rutgers.edu/NMRsoftware/nmr_software.html AutoAssign] is a constraint-based expert system for automating the assignment and analysis of backbone NMR resonance assignments of proteins (Ref. 1).  The program is implemented in C++, Java2, and Perl programming languages and supported on  SGI-IRIX, Sun-Solaris, MAC-OSX, x86-Linux, and x86_64-Linux architectures. The newest AutoAssign distribution (version 2.4.0) automates the assignments of HN, NH, CO, CA, CB, HA, and HB resonances in non-, partially-, and fully-deuterated samples.  The rich graphical user interface (GUI) provides a many sets of tools for dataset conversions, assignment validations, and various graphical displays of assignment results.  AutoAssign is well tested on a large number of independently-collected triple-resonance NMR data sets of proteins ranging in size from ~6 to ~32 kD, including one fully-deuterated protein and and a dataset with reduced-dimensionality experiments.  AutoAssign performs the automated analysis of assignments in only seconds on current RISC and x86 platforms.  
[http://www-nmr.cabm.rutgers.edu/NMRsoftware/nmr_software.html AutoAssign] is a constraint-based expert system for automating the assignment and analysis of backbone NMR resonance assignments of proteins (Ref. 1,2).  The program is implemented in C++, Java2, and Perl programming languages and supported on  SGI-IRIX, Sun-Solaris, MAC-OSX, x86-Linux, and x86_64-Linux architectures. The newest AutoAssign distribution (version 2.4.0) automates the assignments of HN, NH, CO, CA, CB, HA, and HB resonances in non-, partially-, and fully-deuterated samples.  The rich graphical user interface (GUI) provides a many sets of tools for dataset conversions, assignment validations, and various graphical displays of assignment results.  AutoAssign is well tested on a large number of independently-collected triple-resonance NMR data sets of proteins ranging in size from ~6 to ~32 kD, including one fully-deuterated protein and and a dataset with reduced-dimensionality experiments.  AutoAssign performs the automated analysis of assignments in only seconds on current RISC and x86 platforms.  


The following description is mainly taken from the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/tutorial.html AutoAssign SOP (tutorial)].&nbsp; Please check this tutorial and the version 2.4.0 [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/index.html help documentation] for more information and usage. <br>  
The following description is mainly taken from the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/tutorial.html AutoAssign SOP (tutorial)].&nbsp; Please check this tutorial and the version 2.4.0 [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/index.html help documentation] for more information and usage. <br>  
Line 97: Line 97:
=== Manually Preparation of Peak Files (UB)<br>  ===
=== Manually Preparation of Peak Files (UB)<br>  ===


One can manually prepare input files by using the follow format, [[UBNMR|UBNMR]] can also be used to prepare the input files for AutoAssign. ''In [http://www.nsm.buffalo.edu/Chem/HTP_twiki43/bin/view/NESG/UBNMR UBNMR] run <code>makeAutoList</code> to generate input files for AutoAssign.''  
One can manually prepare input files by using the follow format, [[UBNMR|UBNMR]] can also be used to prepare the input files for AutoAssign. ''In [[UBNMR|UBNMR]] run <code>makeAutoList</code> to generate input files for AutoAssign.''  


<br>  
<br>  


*Peak File Format: <br> <pre>&lt;Index&gt;    &lt;dim1&gt;    &lt;dim2&gt;    [dimX]    &lt;Intensity&gt;    &lt;Label&gt;[.PeakNotes]
*Peak File Format:
</pre>
<pre> &lt;Index&gt;    &lt;dim1&gt;    &lt;dim2&gt;    [dimX]    &lt;Intensity&gt;    &lt;Label&gt;[.PeakNotes]
 
...
...  
*
 
</pre>
*
*Here is an example HNCA peak list file: <br>
 
<pre>   #Index  Xppm    Yppm    Zppm    Intensity    Label.notes
*Here is an example HNCA peak list file: <br> <pre>   #Index  Xppm    Yppm    Zppm    Intensity    Label.notes
  126    8.871      110.859  50.247  3242120        HNCA
</pre>
  125    8.870      110.898  62.529  724463        HNCA.;g45
 
73      8.744      116.161  56.614  2287600      HNCA
    126    8.871      110.859  50.247  3242120        HNCA
  &nbsp;:
    125    8.870      110.898  62.529  724463        HNCA.;g45
  &nbsp;:
    73      8.744      116.161  56.614  2287600      HNCA
    145     9.153   112.004     57.415   2788050       HNCA   </pre>  
  &nbsp;:
where individual fields are separated by spaces and/or tabs, and comments are indicated by a "#" sign at the beginning of a line.  
  &nbsp;:
 
145 9.153 112.004 57.415 2788050 HNCA <br> where individual fields are separated by spaces and/or tabs, and comments are indicated by a "#" sign at the beginning of a line.  


== '''Performing Resonance Assignments'''  ==
== '''Performing Resonance Assignments'''  ==
Line 124: Line 121:


*One can open a dataset to analyze start from the main menu by selecting <code>File+ -&gt; =Open Control File</code> menu option or pressing the <code>Open Control File</code> button on the 4th tab (dataset completion) of the Dataset Creation Graphical User Interface.  
*One can open a dataset to analyze start from the main menu by selecting <code>File+ -&gt; =Open Control File</code> menu option or pressing the <code>Open Control File</code> button on the 4th tab (dataset completion) of the Dataset Creation Graphical User Interface.  
*[http://www.nsm.buffalo.edu/Chem/HTP_twiki43/bin/view/NESG/AutoAssign AutoAssign] will bring up the following window prompting the user to connect to a server (Figure 8):<br>
*AutoAssign will bring up the following window prompting the user to connect to a server (Figure 8):<br>


==== Figure 8: Connection to Server  ====
==== Figure 8: Connection to Server  ====
Line 134: Line 131:
*Next a blank <code>Connectivity Map</code> window is opened (Figure 9).
*Next a blank <code>Connectivity Map</code> window is opened (Figure 9).


==== Figure 9:&nbsp; Connectivity Map ====
==== Figure 9:&nbsp; Connectivity Map ====


=== Starting a Backbone Resonance Assignment  ===
=== Starting a Backbone Resonance Assignment  ===
Line 147: Line 144:
[[Image:Autoassign10.jpg]]  
[[Image:Autoassign10.jpg]]  


<br>  
 
 
=== Assignment Constraints<br> ===
 
The user can specify a variety of assignment constraints in the Notes column of a Sparky peak list using a controlled vocabulary.&nbsp; These assignment constraints include: spin system typing, grouping and mapping constraints which are used by AutoAssign in the assignment process.&nbsp; A complete description of assignment constraints and their usage in AutoAssign is provided in the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/preparing_datasets.html#Assignment_Constraints online documentation].


=== '''Examining and Saving the Resonance Assignment Results'''  ===
=== '''Examining and Saving the Resonance Assignment Results'''  ===
Line 157: Line 158:
#There are several options under <code>Save</code> -&gt; <code>Chemical Shifts<span style="font-family: sans-serif;"> for saving the assignments in bmrb formats (2.1 and 3.0) and connectivity map format (cmap).</span></code><code>&lt;span style="font-family: sans-serif;" /&gt;</code><code>&lt;span style="font-family: sans-serif;" /&gt;</code><br><code></code>
#There are several options under <code>Save</code> -&gt; <code>Chemical Shifts<span style="font-family: sans-serif;"> for saving the assignments in bmrb formats (2.1 and 3.0) and connectivity map format (cmap).</span></code><code>&lt;span style="font-family: sans-serif;" /&gt;</code><code>&lt;span style="font-family: sans-serif;" /&gt;</code><br><code></code>


<br>
== '''The Connectivity Map Image Editor'''  ==
A connectivity map image represents resonance and other experimental data on the protein sequence in a graphical form.&nbsp; The CMap Image Editor (which can be launched under the Tools tab) allows one to create such an image (Figure 11).&nbsp; This facility is particularly useful for generating cmap images for publication.&nbsp; For a complete description of this editor and all of its functions see the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/cmap_image_editor.html on-line documentation].
==== Figure 11:&nbsp; A CMap Image  ====


[[Image:Cmap image editor.gif]]


== '''The Connectivity Map Image Editor''' ==
== '''AutoPeak Scripts''' ==


A connectivity map image represents resonance and other experimental data on the protein sequence in a graphical form.&nbsp; The CMap Image Editor allows one to create such an image (Figure 11).&nbsp; This facility is particularly useful for generating cmap images for publication.&nbsp; For a complete description of this editor and all of its functions see the [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/cmap_image_editor.html on-line documentation].
The AutoAssign bin directory features a plethora of perl scripts written by Hunter Moseley for many functions related to AutoAssign and its interplay with various programs including Sparky, as well as interconversion between a variety of formats.&nbsp; A [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/PerlScripts.html glossary] of these scripts is available and a more in-depth description of the more commonly used scripts in our structure determination process is available [http://www-nmr.cabm.rutgers.edu/NMRsoftware/autoassign/Documentation/Help/MajorUsesOfAutoPeak.html here].&nbsp; Included among the major applications of the AutoPeak scripts is the creation and manipulation of chemical shift files in BMRB&nbsp;format.&nbsp; Note that AutoPeak scripts in AutoAssign versions above 1.15.1 are written for BMRB&nbsp;3.0 format. Here is a list of the scripts we most often use in our structure determinations:


==== Figure 11:&nbsp; A CMap Image ====
*bmrb2cmap.pl - converts bmrb format to cmap format. bmrbACS2SparkyPks.pl - converts bmrb assigned chemical shift save frame to Assigned Sparky Peak List format.
*bmrb2AutoStructure.pl - converts bmrb format to AutoStructure Resonance List format.
*compare_bmrb.pl - compares two bmrb files and returns how well they match.
*missing_shifts.pl - reports the missing AtomTypes in a given BMRB file.
*sparkyPks2bmrb.pl - converts a sparky peaklist into a BMRB assigned peak list save frame.
*sparkyRL2bmrb.pl- converts Sparky resonance list into a BMRB file.


[[Image:Cmap_image_editor.gif]]
<br>


== <br> '''References'''<br>  ==
== '''References'''<br>  ==


1. &nbsp; Zimmerman, D.E., Kulikowski, C.A., Huang, Y., Feng, W., Tashiro, M.,&nbsp; Shimotakahara, S., Chien, C., Powers, R. and Montelione, G.T. (1997) Automated analysis of protein NMR assignments using methods from artificial intelligence. ''J. Mol. Biol. 269'', 592-610. <br>  
1. &nbsp; Zimmerman, D.E., Kulikowski, C.A., Huang, Y., Feng, W., Tashiro, M.,&nbsp; Shimotakahara, S., Chien, C., Powers, R. and Montelione, G.T. (1997) Automated analysis of protein NMR assignments using methods from artificial intelligence. ''J. Mol. Biol. 269'', 592-610. <br>  


2.&nbsp;&nbsp; Moseley, H.N.B., Monleon, D. and Montelione, G.T. (2001)&nbsp; Automatic Determination of Protein Backbone Resonance Assignments from Triple Resonance NMR Data. ''<span style="font-style: italic;">Methods</span>'' Enzymology 339'', 91-108.''


<br>


<br>


-- GaohuaLiu - 11 Jun 2007


-- GaohuaLiu - 11 Jun 2007
-- Updated/Edited by JimAramini - 09 Nov 2009


-- Updated/Edited by JimAramini - 09 Nov 2009
&nbsp;

Latest revision as of 17:59, 16 December 2010

Introduction

AutoAssign is a constraint-based expert system for automating the assignment and analysis of backbone NMR resonance assignments of proteins (Ref. 1,2).  The program is implemented in C++, Java2, and Perl programming languages and supported on  SGI-IRIX, Sun-Solaris, MAC-OSX, x86-Linux, and x86_64-Linux architectures. The newest AutoAssign distribution (version 2.4.0) automates the assignments of HN, NH, CO, CA, CB, HA, and HB resonances in non-, partially-, and fully-deuterated samples.  The rich graphical user interface (GUI) provides a many sets of tools for dataset conversions, assignment validations, and various graphical displays of assignment results.  AutoAssign is well tested on a large number of independently-collected triple-resonance NMR data sets of proteins ranging in size from ~6 to ~32 kD, including one fully-deuterated protein and and a dataset with reduced-dimensionality experiments.  AutoAssign performs the automated analysis of assignments in only seconds on current RISC and x86 platforms.

The following description is mainly taken from the AutoAssign SOP (tutorial).  Please check this tutorial and the version 2.4.0 help documentation for more information and usage.

Using AutoAssign

Input Files

The input files for AutoAssign are:

Opening AutoAssign

  • Run the startup script autoassign If you are running the client on the same machine as the server.
  • If you are running the client on a machine different from the server, then you must first make sure that the server is running on the other machine. The easiest way to do this is to run the startup script autoserver on this other computer.

The main AutoAssign window will appear (Figure 1).

Figure 1:  AutoAssign Main Window

Autoassign1.jpg

Creating a Data Set

Using the Peaklist Converter in the Graphical User Interface

The easiest way to create a dataset is to using the Dataset Creation Graphical User Interface. This interface will guide you through the basic steps of converting your peak lists, registering your peak lists, creating a control file, and evaluating the quality of your peak lists. Go to File -> Convert/Create Dataset from the main menu option to start the Dataset Creation Graphical User Interface (Figure 2).

Figure 2:  The Peak List Converter

Autoassign2.jpg

The process is divided into 4 steps:  1. Peak list conversion, 2. Registration, 3. Data set properties, and 4. Data set completion.

Peak list conversion

The peak list conversion process is designed to reformat given peak lists into AutoAssign format (Figure 3).  Here the user defines the columns and spectral properties of each peak list.

Figure 3:  Peak List Conversion Page

Autoassign3.jpg

  • Working directory [text field]: it is a base directory that the interface will use to create a subdirectory with a time stamp. The default is set to the current working directory.
  • Auto Fill [button]: when a user decides to reuse the same peak lists or the same type of configuration based on the previous analysis, this function allows a user to import the data previously used.
  • Add Experiment [button]: This function allows a user to display additional blank peaklist panel.
  • Clear All Entries [button]: this function clears populated data fields displayed on screens.
  • Convert Peak Lists [button]: this function runs a set of AutoAssign perl scripts to converts peak lists entered by a user. It generates intermediate files used to compute registration/tolerance values for the subsequent steps. All scripts executions run by interface is logged in event.log file found under the subdirectory named with a time stamp along with other intermediate files.

Registration

A user will be taken to the Registration tab when Convert Peak List button is clicked. This will allow a user to calculate registration values and apply those values to shift peaks by creating new / temporal peaklist files (Figure 4).

Figure 4: Peak List Registration Page

Autoassign4.jpg

  • Calculate Registration [button]: this function starts calculating registration value and it will open a progress monitor window to notify you how many calculations are completed. The error may be recorded in .reg file(s) under subdirectory if it occurs.
  • View Registration [button]: the interface will create an html page by consolidating .reg files generated and will try to open it with a browser. The Weighted Registration values are used to register peak lists. The Full Std values are used in calculating tolerances included in the control file (Figure 5).
  • Revert to Original Value [button]: this function refills text fields for registration value with original registration value calculated.
  • Apply Registration [button]: this function applies registration values to each peaklist. It creates new peaklist files with _shifted.pks extension.

Figure 5: Registration Results Page

Autoassign5.jpg

Dataset Properties

After registration, the converter moves to the Data Set Properties page (Figure 6). A user is asked to provide additional information to complete the control file creation. The value entered by a user on this tab will be used in the properties, tolerances and sequence sections in a control file.

  • Open Sequence File [button]: this function opens a window for a user to select a sequence file.
  • Save Properties/Create Control File [button]: this function saves dataset property information into an internal data structure and use it to produce a control file. This button should be clicked to produce an initial control file and the result will be displayed on the next tab.

Figure 6:  Data Set Properties Page

Autoassign6.jpg

Dataset Completion

A user can review the control file generated by the interface, manually edit its content, and then save it (Figure 7). To save a revised control file select the Rename Control File button. 

  • View Quality Report [button]: this function opens a window for a user to view quality assessment report on the dataset.
  • Read Control File [button]: this function allows a user to import an existing control file to view.
  • Revert to Original [button]: this function brings back original control file to the screen.
  • Rename Control File [button]: this function allows to save a control file with a different name.
  • Open Control File [button]: this function opens a control file for analysis. The default control file is the one with original values. When a user saved the control file, that file becomes the default.

Figure 7:  Peak List Converter Page

Autoassign7.jpg

Follow the Dataset Creation process that the interface guides you through. At the end, you will have a dataset in AutoAssign format.  Hitting the Open Control File button launches a new AutoAssign connectivity map window, and the user can proceed to computing resonance assignments.

Manually Preparation of Peak Files (UB)

One can manually prepare input files by using the follow format, UBNMR can also be used to prepare the input files for AutoAssign. In UBNMR run makeAutoList to generate input files for AutoAssign.


  • Peak File Format:
	<Index>     <dim1>     <dim2>     [dimX]     <Intensity>     <Label>[.PeakNotes]
	...
	*
  • Here is an example HNCA peak list file:
   	 #Index   Xppm     Yppm     Zppm     Intensity    Label.notes
   	 126     8.871      110.859   50.247   3242120        HNCA
   	 125     8.870      110.898   62.529   724463         HNCA.;g45
	 73       8.744      116.161   56.614   2287600       HNCA
   	 :
   	 :
    	145     9.153    112.004     57.415   2788050       HNCA   

where individual fields are separated by spaces and/or tabs, and comments are indicated by a "#" sign at the beginning of a line.

Performing Resonance Assignments

Connect to the server and open control-file

  • One can open a dataset to analyze start from the main menu by selecting File+ -> =Open Control File menu option or pressing the Open Control File button on the 4th tab (dataset completion) of the Dataset Creation Graphical User Interface.
  • AutoAssign will bring up the following window prompting the user to connect to a server (Figure 8):

Figure 8: Connection to Server

Autoassign8.jpg

If the client and server are on the same computer, you should be able to simply select "Connect".

  • Next a blank Connectivity Map window is opened (Figure 9).

Figure 9:  Connectivity Map

Starting a Backbone Resonance Assignment

Select the Execute -> Default Execution menu option to start the analysis.

  • Default Execution is the suggested method to run the program. The Refined Execution Execute menu option run AutoAssign in an aggressive mode which is not as safe as the Default Execution option.
  • The Connectivity Map will show the resonance assignment results as the analysis progresses. The whole analysis should only take a few seconds (Figure 10).

Figure 10:  Assigned Connectivity Map

Autoassign10.jpg


Assignment Constraints

The user can specify a variety of assignment constraints in the Notes column of a Sparky peak list using a controlled vocabulary.  These assignment constraints include: spin system typing, grouping and mapping constraints which are used by AutoAssign in the assignment process.  A complete description of assignment constraints and their usage in AutoAssign is provided in the online documentation.

Examining and Saving the Resonance Assignment Results

  1. The saving format options can be selected under the save -> Chemical Shift submenus.
  2. The Specific GS, Unassigned Residues, and Unassigned GSs Examine menu options are very valuable to troubleshoot problems with assignments or missing assignments. These can be as guides to reexamine spectra to find missing peaks or to apply peak-based assignment constraints if such interventions are necessary.
  3. The results file saved from Examine -> All GSs is used to convert the AutoAssign result to XEASY. 
  4. For individually assigned peak lists in Sparky format click:  Save -> All Sparky Peaklists.
  5. There are several options under Save -> Chemical Shifts for saving the assignments in bmrb formats (2.1 and 3.0) and connectivity map format (cmap).<span style="font-family: sans-serif;" /><span style="font-family: sans-serif;" />


The Connectivity Map Image Editor

A connectivity map image represents resonance and other experimental data on the protein sequence in a graphical form.  The CMap Image Editor (which can be launched under the Tools tab) allows one to create such an image (Figure 11).  This facility is particularly useful for generating cmap images for publication.  For a complete description of this editor and all of its functions see the on-line documentation.

Figure 11:  A CMap Image

Cmap image editor.gif

AutoPeak Scripts

The AutoAssign bin directory features a plethora of perl scripts written by Hunter Moseley for many functions related to AutoAssign and its interplay with various programs including Sparky, as well as interconversion between a variety of formats.  A glossary of these scripts is available and a more in-depth description of the more commonly used scripts in our structure determination process is available here.  Included among the major applications of the AutoPeak scripts is the creation and manipulation of chemical shift files in BMRB format.  Note that AutoPeak scripts in AutoAssign versions above 1.15.1 are written for BMRB 3.0 format. Here is a list of the scripts we most often use in our structure determinations:

  • bmrb2cmap.pl - converts bmrb format to cmap format. bmrbACS2SparkyPks.pl - converts bmrb assigned chemical shift save frame to Assigned Sparky Peak List format.
  • bmrb2AutoStructure.pl - converts bmrb format to AutoStructure Resonance List format.
  • compare_bmrb.pl - compares two bmrb files and returns how well they match.
  • missing_shifts.pl - reports the missing AtomTypes in a given BMRB file.
  • sparkyPks2bmrb.pl - converts a sparky peaklist into a BMRB assigned peak list save frame.
  • sparkyRL2bmrb.pl- converts Sparky resonance list into a BMRB file.


References

1.   Zimmerman, D.E., Kulikowski, C.A., Huang, Y., Feng, W., Tashiro, M.,  Shimotakahara, S., Chien, C., Powers, R. and Montelione, G.T. (1997) Automated analysis of protein NMR assignments using methods from artificial intelligence. J. Mol. Biol. 269, 592-610.

2.   Moseley, H.N.B., Monleon, D. and Montelione, G.T. (2001)  Automatic Determination of Protein Backbone Resonance Assignments from Triple Resonance NMR Data. Methods Enzymology 339, 91-108.



-- GaohuaLiu - 11 Jun 2007

-- Updated/Edited by JimAramini - 09 Nov 2009