RDCvis & KiNG: Difference between revisions

From NESG Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 122: Line 122:
with the possible RDC curve shapes.<span>&nbsp;</span></span>  
with the possible RDC curve shapes.<span>&nbsp;</span></span>  


IMAGE&nbsp;<span>These varied curve shapes
IMAGE&nbsp;
 
<span />
 
<span />
 
<span>These varied curve shapes
arise from the intersection of the surface representing the possible solutions
arise from the intersection of the surface representing the possible solutions
for the RDC equation with the sphere representing the possible positions for a given
for the RDC equation with the sphere representing the possible positions for a given
Line 146: Line 152:


Translational overlap while maintaining orientation in order to investigate the match of the internuclear vector across all models in the ensemble to the target curves of the measured RDC  
Translational overlap while maintaining orientation in order to investigate the match of the internuclear vector across all models in the ensemble to the target curves of the measured RDC  
In general, even the most well defined NMR ensembles will have enough deviation from model to model that a close-up comparison of the behavior of residues is difficult with an overall superposition.<span>&nbsp; </span>When all models are visible, the visual clutter from all of the models is too overwhelming for reasonable analysis.<span>&nbsp; </span>Viewing models one at a time resolves the issue of clutter, but it is still difficult to compare one model to the others.<span>&nbsp;
</span>On-demand local superimposition of the models is one possible solution.<span>&nbsp; </span>However, for visualizing RDC data, which is directly related to the global orientation of the model, any rotation of the models would alter the relationship of the model to the RDCs.<span>&nbsp; </span>Therefore, the co-centering tool translates all the models onto a single point with no rotational aspect in order to maintain the global orientation of the models.<span>&nbsp;&nbsp;</span>
In the majority of cases, co-centering reduces the visual clutter dramatically, allowing for a meaningful observation about the model-by-model agreement of the internuclear bond vector to the RDC curves and a visual assessment the match of the model to the data in the local context.<span>&nbsp; </span>There are some situations where the co-centering may not be enough help.<span>&nbsp; </span>Particularly, in regions where there is a limited amount of experimentally observed data, the different models of the ensemble may have wildly different conformations, which makes co-centering less effective.&nbsp;


==== Wal through of co-centering tool - screenshots  ====
==== Wal through of co-centering tool - screenshots  ====
Line 166: Line 178:


Implications for flexibility of the internuclear vector  
Implications for flexibility of the internuclear vector  
There exists some variation in the internuclear vector match to the RDC data drawn as a curve.<span>&nbsp; </span>This flexibility can result in a fanning out of the internuclear vector along a target curve.<span>&nbsp;
</span>The likely contributors to this variation are the “orientation dependent variability,” and the error model of observed RDC’s.<span>&nbsp; </span>I will later discuss modeling the error of the observed RDCs.<span>&nbsp; </span>
Generally, the orientation of the alignment tensor to the molecule (and its rhombicity) will determine the shape of the Saupe curves at each internuclear bond vector where they are experimentally observed.<span>&nbsp; </span>In addition, the orientation of the local structural features of the molecule in relation to the given Saupe curve shape will determine the amount and direction of variation allowable for structural interpretation.<span>&nbsp; </span>
Both orientation of the tensor to the molecule and orientation of the local structural features in relation to the curve interact with one another to impact the potential structural interpretations. For example, if a curve is relatively flat, a peptide rotation approximately around the C
direction could swing the NH bond vector along the curve if the curve tangent has the right relationship to the C.
An orientation dependent variability should not be taken to imply dynamics.  Rather, it demonstrates that for a given RDC in a local area, there is an arc along which multiple positions remain consistent with the data because of the orientation and shape of the RDC curve in the local environment of the structure model.


===== One Curve Rule  =====
===== One Curve Rule  =====
Line 174: Line 198:


Special case where target curves overlap  
Special case where target curves overlap  
The usual methodology for use of RDCs in structure solution implicitly assumes one conformation, so that in general a given bond vector should only line up with one or the other curve.<span>&nbsp; </span>Motion or multiple conformations are of course possible, and for loops even probable, but this is not the way to identify such motion.<span>&nbsp;
</span>It would require an extremely unlikely coincidence for a motion or conformational change to line up each of two orientational clusters for a given internuclear vector exactly on a different one of the two curves.<span>&nbsp; </span>Even if a residue were sampling conformations that could match both curves, this would result in averaging of the RDC and end up with a different, smaller RDC value.<span>&nbsp; </span>This averaging affect has been treated in the literature, where others have tried to develop a model of conformational sampling that stays in agreement with the observed RDC’s (Clore, 2004; Brooks, 1997; Hess, 2003).<span>&nbsp; </span>We conclude that this behavior in a structure ensemble - of two distinctly different conformations pointing the internuclear vector towards opposite curves - is a potential systematic error allowed by the usual procedure of requiring each individual model to match the scaler RDC value, without considering the relationship of the models to one another or to the target curves.


===== Error Model Issues  =====
===== Error Model Issues  =====
Line 199: Line 226:
Orthogonal criteria that can give the user an inditation of the local quality of the ensemble of models and identify areas where fixes may need to be made.  
Orthogonal criteria that can give the user an inditation of the local quality of the ensemble of models and identify areas where fixes may need to be made.  


== Using RDCvis in iterative refinement of NMR structures ==
== Using RDCvis in iterative refinement of NMR structures ==


=== Further restraining a structure using RDCvis and other information ===
=== Further restraining a structure using RDCvis and other information ===


==== Example: &nbsp;CcmE ====
==== Example: &nbsp;CcmE ====


One RDC v 2RDCs
One RDC v 2RDCs  


CNS v Xplor-NIH
CNS v Xplor-NIH

Revision as of 18:47, 1 November 2011

 This page is under construction as of November 1st, 2011.  


Draft Outline

Intro to RDCvis

Visualizing the RDC curves in their structural context, especially when combined with other structure quality visualizations allows users to easily identify and study areas of their models which need improvement.

Software for generating RDC visualizations, dubbed RDCvis and built into KiNG (Chen, 2009), requires a PDB format coordinate file and an NMR restraints file (in CNS format) with RDC data.  RDCvis outputs the RDC visualizations in kinemage format (Richardson, 1992), as a standalone file that is routinely appended onto an existing multi-model kinemage for viewing in KiNG. These curves plotted using the kinemage graphics format, take advantage of the powerful and extensive infrastructure that already exists for manipulating and viewing kinemages in Mage, KiNG, and KinImmerse (Richardson, 1992; Chen, 2009; Block, 2009).


RDCvis draws RDC curves by using singular value decomposition (SVD) (losonczi1999) to calculate a Saupe alignment matrix (saupe1968) from the RDCs.  The method is described in more detail in the Supporting Information.

RDCvis allows users to visualize and interact with the possible RDC curve shapes. 

IMAGE 

These varied curve shapes arise from the intersection of the surface representing the possible solutions for the RDC equation with the sphere representing the possible positions for a given internuclear bond vector.  IMAGE


Getting RDCvis to work in KiNG

Walk-through of loading RDCs

What you need -

a) MolProbity multimodel-multicrit kinemage

Multiple models visualized at once with the local geometric and steric validation criteria from the Richardson lab displayed at each residue.

b) .tbl file (note on acceptible formats

One note is that a significant barrier to using RDCvis is the lack of consistency in the deposited NMR restraints files.  A more strictly defined standard data-file format would make RDCvis more straightforward to use and thus routinely useful to a wider community. 

Loading the files - screenshots

Co-centering tool

Translational overlap while maintaining orientation in order to investigate the match of the internuclear vector across all models in the ensemble to the target curves of the measured RDC

In general, even the most well defined NMR ensembles will have enough deviation from model to model that a close-up comparison of the behavior of residues is difficult with an overall superposition.  When all models are visible, the visual clutter from all of the models is too overwhelming for reasonable analysis.  Viewing models one at a time resolves the issue of clutter, but it is still difficult to compare one model to the others.  On-demand local superimposition of the models is one possible solution.  However, for visualizing RDC data, which is directly related to the global orientation of the model, any rotation of the models would alter the relationship of the model to the RDCs.  Therefore, the co-centering tool translates all the models onto a single point with no rotational aspect in order to maintain the global orientation of the models.  

In the majority of cases, co-centering reduces the visual clutter dramatically, allowing for a meaningful observation about the model-by-model agreement of the internuclear bond vector to the RDC curves and a visual assessment the match of the model to the data in the local context.  There are some situations where the co-centering may not be enough help.  Particularly, in regions where there is a limited amount of experimentally observed data, the different models of the ensemble may have wildly different conformations, which makes co-centering less effective. 


Wal through of co-centering tool - screenshots

Using RDCvis to analyze RDCs in their local context

Analysis with one or multiple RDCs

Just like others have predicted, it is better with multiple RDCs

Even if the tensors are very similar, it can still give useful data to have two RDCs

Philosophically similar to crystallography in that it is looking at validation crteria mapped on a model while also looking at the experimental data (in this case RDCs, in xray it is the density)

Patterns to look for

Orientation Dependent Variability

Contribution from both the orientation of the local structure to the RDC visualized and the RDC visualized to the field.

Implications for flexibility of the internuclear vector

There exists some variation in the internuclear vector match to the RDC data drawn as a curve.  This flexibility can result in a fanning out of the internuclear vector along a target curve.  The likely contributors to this variation are the “orientation dependent variability,” and the error model of observed RDC’s.  I will later discuss modeling the error of the observed RDCs. 

Generally, the orientation of the alignment tensor to the molecule (and its rhombicity) will determine the shape of the Saupe curves at each internuclear bond vector where they are experimentally observed.  In addition, the orientation of the local structural features of the molecule in relation to the given Saupe curve shape will determine the amount and direction of variation allowable for structural interpretation. 

Both orientation of the tensor to the molecule and orientation of the local structural features in relation to the curve interact with one another to impact the potential structural interpretations. For example, if a curve is relatively flat, a peptide rotation approximately around the C

direction could swing the NH bond vector along the curve if the curve tangent has the right relationship to the C.


An orientation dependent variability should not be taken to imply dynamics. Rather, it demonstrates that for a given RDC in a local area, there is an arc along which multiple positions remain consistent with the data because of the orientation and shape of the RDC curve in the local environment of the structure model.


One Curve Rule

It can never be on both...

Searching for these systematic errors

Special case where target curves overlap

The usual methodology for use of RDCs in structure solution implicitly assumes one conformation, so that in general a given bond vector should only line up with one or the other curve.  Motion or multiple conformations are of course possible, and for loops even probable, but this is not the way to identify such motion.  It would require an extremely unlikely coincidence for a motion or conformational change to line up each of two orientational clusters for a given internuclear vector exactly on a different one of the two curves.  Even if a residue were sampling conformations that could match both curves, this would result in averaging of the RDC and end up with a different, smaller RDC value.  This averaging affect has been treated in the literature, where others have tried to develop a model of conformational sampling that stays in agreement with the observed RDC’s (Clore, 2004; Brooks, 1997; Hess, 2003).  We conclude that this behavior in a structure ensemble - of two distinctly different conformations pointing the internuclear vector towards opposite curves - is a potential systematic error allowed by the usual procedure of requiring each individual model to match the scaler RDC value, without considering the relationship of the models to one another or to the target curves.

Error Model Issues

The error model is too tight

The error model is not representative of the error in measurement

At the poles it isn't particularly helpful


Using other helpful data

Talos+ restraints

Use of Talos+ restraints to assist in restraining the backbone appropriately

Order parameters

Helpful for understanding the potential variability actually observed at a residue (and perhaps making the argument for not using as many restraints or explaining why the behavior of the ensemble of models at that point is peculiar)

Sterics and geometry from MolProbity

Orthogonal criteria that can give the user an inditation of the local quality of the ensemble of models and identify areas where fixes may need to be made.

Using RDCvis in iterative refinement of NMR structures

Further restraining a structure using RDCvis and other information

Example:  CcmE

One RDC v 2RDCs

CNS v Xplor-NIH