TALOS: Difference between revisions

From NESG Wiki
Jump to navigation Jump to search
m (moved NESG:UsingTALOS to TALOS)
No edit summary
Line 1: Line 1:
== '''Finding Dihedral Angle Constraints with TALOS''' ==
== '''Introduction''' ==


TALOS (Torsion Angle Likelihood Obtained from Shift and sequence similarity) is a database system for empirical prediction of <tt>phi</tt> and <tt>psi</tt> backbone torsion angles from five kinds (HA, CA, CB, CO, N) of chemical shifts for a given protein sequence.  
TALOS (Torsion Angle Likelihood Obtained from Shift and sequence similarity) is a database system for empirical prediction of <tt>phi</tt> and <tt>psi</tt> backbone torsion angles from five kinds (HA, CA, CB, CO, N) of chemical shifts for a given protein sequence.  


For detailed information please check the [http://spin.niddk.nih.gov/NMRPipe/talos/ TALOS website].
For detailed information please check the [http://spin.niddk.nih.gov/NMRPipe/talos/ TALOS website].  


=== '''Step-by-Step Procedure''' ===
=== '''Using TALOS''' ===
 
#Create a subdirectory (for example, <tt>structure/cyana21/talos</tt>) and copy the latest sequence and atom list files there. It is convenient to have them named <tt>XXXX.seq</tt> and <tt>XXXX.prot</tt>, where <tt>XXXX</tt> is an NESG target ID or other protein name. When using CARA, export the chemical shifts as an atom list file in this directory.
#Create and init.cya in this directory as described in "[[NESG:CYANAInitFile|Creating an init.cya file for CYANA 2.1]]" or copy a previously used file.
#Start CYANA and type:


# Create a subdirectory (for example, <tt>structure/cyana21/talos</tt>) and copy the latest sequence and atom list files there. It is convenient to have them named <tt>XXXX.seq</tt> and <tt>XXXX.prot</tt>, where <tt>XXXX</tt> is an NESG target ID or other protein name. When using CARA, export the chemical shifts as an atom list file in this directory.
# Create and init.cya in this directory as described in "[[NESG:CYANAInitFile|Creating an init.cya file for CYANA 2.1]]" or copy a previously used file.
# Start CYANA and type:
      <nowiki>
       read prot XXXX.prot
       read prot XXXX.prot
       taloslist XXXX
       taloslist XXXX
      </nowiki>
 
# This will create the TALOS input file <tt>XXXX.tab</tt>. In a UNIX shell run <tt>talos.tcl -in XXXX.tab</tt>. During the database search, a series of files <tt>pred/res*.tab</tt> will be created. Each one of these files tallies the 10 best database matches for a given residue in the target protein. Before exiting, <tt>talos.tcl</tt> calls <tt>vina.tcl</tt> to create a file <tt>pred.tab</tt>, which includes an initial summary of the prediction results. The database search will typically take about 10-60 sec per residue in the target.
#This will create the TALOS input file <tt>XXXX.tab</tt>. In a UNIX shell run <tt>talos.tcl -in XXXX.tab</tt>. During the database search, a series of files <tt>pred/res*.tab</tt> will be created. Each one of these files tallies the 10 best database matches for a given residue in the target protein. Before exiting, <tt>talos.tcl</tt> calls <tt>vina.tcl</tt> to create a file <tt>pred.tab</tt>, which includes an initial summary of the prediction results. The database search will typically take about 10-60 sec per residue in the target.  
# In a UNIX shell run <tt>rama.tcl -in XXXX.tab</tt>. Here you can examine <tt>phi</tt> and <tt>psi</tt> distributions, choose database matches to be used in calculating predictions, and classify prediction results as <tt>Good</tt>, <tt>Ambiguous</tt> or <tt>Unclassified</tt> / <tt>New</tt>. See below for the guidelines for classifying prediction. Save your modifications in a new file, for example, <tt>talos.tab</tt>.
#In a UNIX shell run <tt>rama.tcl -in XXXX.tab</tt>. Here you can examine <tt>phi</tt> and <tt>psi</tt> distributions, choose database matches to be used in calculating predictions, and classify prediction results as <tt>Good</tt>, <tt>Ambiguous</tt> or <tt>Unclassified</tt> / <tt>New</tt>. See below for the guidelines for classifying prediction. Save your modifications in a new file, for example, <tt>talos.tab</tt>.  
# Start CYANA and type:
#Start CYANA and type:
<nowiki>
 
     talosaco pred    #or "talos.tab" -- use the appropriate filename
     talosaco pred    #or "talos.tab" -- use the appropriate filename
     write aco talos.aco
     write aco talos.aco
</nowiki>


See also the <tt>~/demo/details/TalosAngleRestraints.cya</tt> example script in your local CYANA 2.1 installation.
See also the <tt>~/demo/details/TalosAngleRestraints.cya</tt> example script in your local CYANA 2.1 installation.  


<br>


=== '''Manual Refinement of TALOS predictions''' ===
=== '''Manual Refinement of TALOS predictions''' ===


Guidelines for refining the TALOS output:
Guidelines for refining the TALOS output:  


* Classify prediction as <tt>Good</tt> only if
*Classify prediction as <tt>Good</tt> only if  
** All 10 best database matches fall in a "consistent" region of the Ramachandran map
**All 10 best database matches fall in a "consistent" region of the Ramachandran map  
** Or 9 out of 10 best database matches fall in a consistent region with <tt>phi < 0</tt>, and the one outlier also lies in <tt>phi < 0</tt> half of the map
**Or 9 out of 10 best database matches fall in a consistent region with <tt>phi &lt; 0</tt>, and the one outlier also lies in <tt>phi &lt; 0</tt> half of the map  
** Or 9 out of 10 of the best database matches fall in a consistent region with <tt>phi > 0</tt>
**Or 9 out of 10 of the best database matches fall in a consistent region with <tt>phi &gt; 0</tt>  
* Accept predictions which are classified as <tt>Good</tt>, whose residues are in beta-sheets or helices according to CSI (excluding the first and the last residue of a secondary structure element).
*Accept predictions which are classified as <tt>Good</tt>, whose residues are in beta-sheets or helices according to CSI (excluding the first and the last residue of a secondary structure element).


For ''de novo'' structure determination it is recommended to take the automatically generated TALOS constraints. Angular constraints outside of secondary structure elements (as determined by CSI) can be commented out in the <tt>talos.aco</tt> file.
For ''de novo'' structure determination it is recommended to take the automatically generated TALOS constraints. Angular constraints outside of secondary structure elements (as determined by CSI) can be commented out in the <tt>talos.aco</tt> file.  


During structure refinement you can refine TALOS predictions against a preliminary structure.
During structure refinement you can refine TALOS predictions against a preliminary structure.  


<nowiki>
<nowiki>
vina.tcl -in XXXX.tab -ref XXXX.pdb -auto
vina.tcl -in XXXX.tab -ref XXXX.pdb -auto
</nowiki>
</nowiki> and <nowiki>
and  
<nowiki>
rama.tcl -in XXXX.tab -ref XXXX.pdb.
rama.tcl -in XXXX.tab -ref XXXX.pdb.
</nowiki>
</nowiki>  
 
The <tt>XXXX.pdb</tt> file '''must''' have only one conformer. Thus, you may need to analyze the angle distributions in a molecular graphics package (e.g. MOLMOL).


The <tt>XXXX.pdb</tt> file '''must''' have only one conformer. Thus, you may need to analyze the angle distributions in a molecular graphics package (e.g. MOLMOL).


<br>


{| border="1"
{| border="1"
|-
|-
|Element||PHI||PSI
| Element  
| PHI  
| PSI
|-
|-
|a-helix||-60||-45
| a-helix  
| -60  
| -45
|-
|-
|b-sheet||-140||135
| b-sheet  
| -140  
| 135
|}
 
=== '''Desription of talosaco.cya Macro'''  ===
 
Invoked as <br> <nowiki>
talosaco file [factor [width]] </nowiki> <br> Here <tt>file</tt> is the TALOS prediction output, <tt>width</tt> is the threshold minimum width for <tt>PHI/PSI</tt> angle distributions, and <tt>factor</tt> is used to scale the width of a distribution when creating an angle constraint. Both <tt>width</tt> and <tt>factor</tt> arguments are optional. By default, <tt>width=20.0</tt> and <tt>factor=2.0</tt>.
 
This macro will create angle constraints for a given residue only if the prediction is classified as "Good" and the residue is not a proline.
 


|}


=== '''Desription of talosaco.cya Macro''' ===
References


Invoked as <br/> <nowiki>
talosaco file [factor [width]] </nowiki> <br/> Here <tt>file</tt> is the TALOS prediction output, <tt>width</tt> is the threshold minimum width for <tt>PHI/PSI</tt> angle distributions, and <tt>factor</tt> is used to scale the width of a distribution when creating an angle constraint. Both <tt>width</tt> and <tt>factor</tt> arguments are optional. By default, <tt>width=20.0</tt> and <tt>factor=2.0</tt>.


This macro will create angle constraints for a given residue only if the prediction is classified as "Good" and the residue is not a proline.


%COMMENT%
-- AlexEletski - 7 Jun 2007


-- Main.AlexEletski - 7 Jun 2007
-- Updated by JimAramini - 11 Nov 2009

Revision as of 05:00, 11 November 2009

Introduction

TALOS (Torsion Angle Likelihood Obtained from Shift and sequence similarity) is a database system for empirical prediction of phi and psi backbone torsion angles from five kinds (HA, CA, CB, CO, N) of chemical shifts for a given protein sequence.

For detailed information please check the TALOS website.

Using TALOS

  1. Create a subdirectory (for example, structure/cyana21/talos) and copy the latest sequence and atom list files there. It is convenient to have them named XXXX.seq and XXXX.prot, where XXXX is an NESG target ID or other protein name. When using CARA, export the chemical shifts as an atom list file in this directory.
  2. Create and init.cya in this directory as described in "Creating an init.cya file for CYANA 2.1" or copy a previously used file.
  3. Start CYANA and type:
     read prot XXXX.prot
     taloslist XXXX
  1. This will create the TALOS input file XXXX.tab. In a UNIX shell run talos.tcl -in XXXX.tab. During the database search, a series of files pred/res*.tab will be created. Each one of these files tallies the 10 best database matches for a given residue in the target protein. Before exiting, talos.tcl calls vina.tcl to create a file pred.tab, which includes an initial summary of the prediction results. The database search will typically take about 10-60 sec per residue in the target.
  2. In a UNIX shell run rama.tcl -in XXXX.tab. Here you can examine phi and psi distributions, choose database matches to be used in calculating predictions, and classify prediction results as Good, Ambiguous or Unclassified / New. See below for the guidelines for classifying prediction. Save your modifications in a new file, for example, talos.tab.
  3. Start CYANA and type:
   talosaco pred    #or "talos.tab" -- use the appropriate filename
   write aco talos.aco

See also the ~/demo/details/TalosAngleRestraints.cya example script in your local CYANA 2.1 installation.


Manual Refinement of TALOS predictions

Guidelines for refining the TALOS output:

  • Classify prediction as Good only if
    • All 10 best database matches fall in a "consistent" region of the Ramachandran map
    • Or 9 out of 10 best database matches fall in a consistent region with phi < 0, and the one outlier also lies in phi < 0 half of the map
    • Or 9 out of 10 of the best database matches fall in a consistent region with phi > 0
  • Accept predictions which are classified as Good, whose residues are in beta-sheets or helices according to CSI (excluding the first and the last residue of a secondary structure element).

For de novo structure determination it is recommended to take the automatically generated TALOS constraints. Angular constraints outside of secondary structure elements (as determined by CSI) can be commented out in the talos.aco file.

During structure refinement you can refine TALOS predictions against a preliminary structure.

vina.tcl -in XXXX.tab -ref XXXX.pdb -auto and rama.tcl -in XXXX.tab -ref XXXX.pdb.

The XXXX.pdb file must have only one conformer. Thus, you may need to analyze the angle distributions in a molecular graphics package (e.g. MOLMOL).


Element PHI PSI
a-helix -60 -45
b-sheet -140 135

Desription of talosaco.cya Macro

Invoked as
talosaco file [factor [width]]
Here file is the TALOS prediction output, width is the threshold minimum width for PHI/PSI angle distributions, and factor is used to scale the width of a distribution when creating an angle constraint. Both width and factor arguments are optional. By default, width=20.0 and factor=2.0.

This macro will create angle constraints for a given residue only if the prediction is classified as "Good" and the residue is not a proline.


References


-- AlexEletski - 7 Jun 2007

-- Updated by JimAramini - 11 Nov 2009