Resonance Assignment/Abacus/FMCGUI commands
Jump to navigation
Jump to search
3. Data Menu
2. Project Menu
Project>New
:Start a new project.
User have to provide a name of the project PROJECTNAME,and to select a directory that will host the project. The project root directory with the same name PROJECTNAME is created.
Project>Load
: Continue to work on previously saved project.
User have to select file PROJECTNAME.prj in the directory PROJECTNAME, where PROJECTNAME is the name of the root directory of the project.
Project>Save
: To save the current state of the project.
What is currently in the computer memory is saved in the file PROJECTNAME.prj located in the root directory of the project.
Poject>Quit
- To save the current state of the project and to quit.
3. Data Menu
The DATA section serves to load&save the data (such as protein sequence and peak lists). Since there are different formats of data-files that could be loaded in memory or saved on disk, one can use this section as format converter as well.
Data>Protein Sequence>Load
: To load a protein sequence into memory.
The input formats:
- 1-letter code (fasta format);
- 3-letter code (standard format).
User have to select the file with sequence and to specify the first residue ID in case it is not specified in the input file.
If there is His-tag in the sequence file, it is recommended to set the first residue ID to a negative number such that the first residue of a protein has ID of 1.
Data>Protein Sequence>Save as
: To save protein sequence in the file on disk.
The output formats:
- 1-letter code (fasta format);
- 3-letter code ("standard" format, for cyana)
- 3-letter code (for AutoStructure);
- 3-letter code (for RCI);
There are separate buttons for different peak lists.
Data>N15 NOESY>
=== Data>C13 NOESY>
Data>Arom NOESY>
Data>N15 HSQC>
Data>C13 HSQC>
Data>HNCA>
=== Data>HNCO>
Data>CBCACONHN>
Data>HBHACONH>
: Load or save a peak list.
Input and output formats:
- Sparky;
- Xeasy;
- Standard
Data>Tolerances
: To set tolerances for chemical shift matching in different spectral dimensions.
Fragment>Load>assigned
: To load assigned chemical shifts (spin-systems) in the memory.
Prerequisites:
- Loaded sequence
Input formats:
- assigned AA-fragments in standard format;
- CYANA chemical shift file (prot-file);
Fragment>Load>PB fragments
: To load unassigned spin-systems in the memory.
Input format :
PB-fragments in standard format.
Fragment>Save>PB fragments
: To save PB-fragments in a file on disk.
Output format:
PB-fragments in standard format.
The name of the saved file and it’s location are specified by user.
There are 3 options to save PB-fragments in the file:
- in order of fragments index, that is in the order by which fragments are stored in memory;
- in order of fragments user ID, U_id;
- in order of fragments assignment ID, A_id. In this case 2 files are saved. One file, with user specified name 'user_name', contains only fragments assigned to protein sequence positions, that is to positions with residue ID of >= 1. The second file, with the name 'user_name_na', contains all not assigned fragments (that is fragments with A_id = -99).
Fragment>Save>cyana
: To save assigned chemical shifts (that is fragments with A_id >0 ) in CYANA format.
Fragment>Save>bmrb
: To save assigned chemical shifts in the format suitable for BMRB deposition (star2.1)
Fragment>Save>talos
: To save assigned chemical shifts in the format suitable for TALOS/CS-Rosetta;
Fragment>Save>abacus
:To save unassigned PB-fragments in the format suitable for BACUS;
Fragment>Create>fawn
: To create/evaluate bPB-fragments.
Prerequisites:
- loaded in memory referenced peak lists of CBCA(CO)HN, HBHA(CO)HN, N15HSQC, and HNCA spectra;
- Specified tolerances.
There are two steps in executing this command.
On the first step, a fake C13HSQC peak list is created and shown in the popped up window “fake C13HSQC” (see Figure 2.2)
User can use the information shown in the main FMCGUI window and check /edit the list in the entry section of “fake C13HSQC” window. Pressing OK will result in loading the peak list from the entry window into memory as C13HSQC peak list.
On the second step, a number of bPB-fragments corresponding to 20 different amino acid types are generated from user-identified spin-systems. Each generated bPB-fragment is evaluated by a score S(T) that measure how good the spin-system chemical shifts match corresponding statistical chemical shifts derived from BMRB database (see Figure 1.2). The bPB-fragment with highest score is selected to form a list of bPB-fragments.
In the result, a new window ‘Create Fragment’ pops up and warning messages of the ‘sps_create’ script are shown in the main FMCGUI window (see Figure 2.3).
The window consists of three sections. The left sections contains suggested bPB-fragments, while the other sections contains two reports of fragments scoring with both C and H resonances and with only C resonances, respectively. Following the warning messages shown in the main FMCGUI window (see Figure 2.4), user can accept/modify generated bPB-fragments. Alternatively, when ‘poor’ bPB-fragments are present, user can go back to spectra, fix the pick lists accordingly, and repeat the fragment generation again.
User-approved bPB-fragments will be loaded in the memory by pressing OK button.
Results:
- C13HSQC peak list loaded in memory
- bPB-fragments are loaded in memory
Frgment>create>abacus
Prerequisites:
- loaded in memory referenced C13HSQC, N15HSQC, and HNCA peak lists and not referenced CBCA(CO)HN peak list; ( as an option, HNCA peak list could be not referenced as well)
- Specified tolerances.
On the second step, a number of PB-fragments corresponding to 20 different amino acid types are generated from user-identified spin-systems. Each generated PB-fragment is evaluated by a score S (see Figure 1.2) that measure how good the spin-system chemical shifts match corresponding statistical chemical shifts derived from BMRB database. The PB-fragment with highest score is selected to form a list of PB-fragments.
Spin-system which have Smax less than 10-4 are reported in the main FMCGUI window (see Figure 2.4). Following these warnings user can accept or to modify generated PB-fragments in the left section of “Create Fragment’ window (see Figure 2.5). Alternatively, user can go back to spectra, fix the pick lists accordingly, and repeat the fragment generation again.
User-approved bPB-fragments will be loaded in the memory by pressing OK button
Results:
- PB-fragments are loaded in memory
Fragment>Type>Calculate>
<div : Probabilistic typing of bPB-fragments (fawn) or PB-fragments (abacus) . </div>
Prerequisites:
- loaded in memory protein sequence
- loaded in memory PB-fragments
- specified tolerances.
Results:
- Fragment typing probabilities are calculated and loaded in memory.
The main FMCGUI window displays (see Figure 2.6):
- the summary table that shows how many fragments of each AA-residue type are expected and how many fragments were actually recognized by the typing script;
- warning messages that suggest user to check and possibly modify typing manually of some fragments manually
Fragment>Type>fix
: To modify typing probabilities .
Prerequisites:
- loaded in memory protein sequence
- loaded in memory PB-fragmen
New window "Fragment Property Modification" (FPM) window is opened (see Figure 2.7). This window has 3 sections.
In the top section of FPM window user can select fragment user ID, U_id. Then typing probabilities for all AA types t will be shown on the graph. The chemical shifts of the fragments and its assignment status (A_id) are shown as well. User can modify typing probabilities of the selected fragment by selecting AA types by clicking right mouse button and pressing ‘Update’ button. In the result only propapbilities corresponding to the selected AA types will be set to the same non-zero values.
In the middle section of FPM window user can select amino acid residue type (see Figure 2.8). Then the graph will show typing probabilities that correspond to the selected residue type for all available fragments IDs f . Selecting a particular fragment U_id by clicking right mouse button (the color of U_id is changed to red) and pressing “Update” button will set the probability to 1 while for all other f will be set to 0. Selecting a particular fragment U_id by clicking left mouse button (the color of U_id is changed to blue) and pressing “Update” button will set the probability to 0.
Fragment>Expected Peaks>
: To generate different peak lists expected from covalent structure of fragments.
Prerequisites:
- protein sequence is loaded in memory
- PB-fragments are loaded in memory
Fragment>Modify assigned
: To correct assigned fragments.
Prerequisites:
- loaded in memory protein sequence
- loaded in memory PB-fragments
- loaded HNCO, CBCACONH, and HNCA peak lists.
- specified tolerances.
In the result CO chemical shifts are added and chemical shift names are corrected for PB-fragments which are assigned (that is which has A_id > -99 )
Assignment>Contacts>HNCA
: To calculate fragments contact map.
Prerequisites:
- PB-fragments are loaded in memory ;
- typing probabilities are calculated;
- HNCA peak list (recommended to be referenced) is loaded in memory.
- tolerances are specified.
In the result, the contact map is calculated and loaded in the memory.
Assignment>Contacts>NOE>fawn
: To calculate fragments contact map.
Prerequisites:
- PB-fragments are loaded in memory;
- typing probabilities are calculated;
- N15_NOESY peak list is loaded in memory.
- tolerances are specified.
In the result, the contact map is calculated using N15 NOESY peak list and loaded in memory.
Assignment>Contacts>NOE>abacus
: To calculate both and fragments contact maps.
Prerequisites:
- PB-fragments are loaded in memory;
- typing probabilities are calculated;
- N15_NOESY and C13_NOESY peak lists are loaded in memory.
- tolerances are specified.
In the result, the contact map is calculated using only N15_NOESY peak list while contact map is calculated using both N15_NOESY and C13_NOESY peak lists that are interpreted by BACUS procedure. Both calculate maps are loaded in memory.
Assignment>Calculate Probabilities>SA
: To calculate assignment probabilities.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments are loaded in memory;
- typing probabilities are calculated;
- fragments contact map is calculated;
- at least one of and fragments contact maps is calculated;
Probabilistic mapping of PB-fragments onto protein sequence is performed using Simulated Annealing Monte Carlo simulations.
A new window “Calculate SA” is open were user can specify different parameters in the control file of the SA simulations (see Figure 2.9).
The main parameters to consider are:
- “Name of the SA run”. Normally the name is sa_run#. A new directory under this name will be created within PROJECTNAME/assign directory. SA calculations will be curried out and the results will be stored in this directory.
- “Size of the pool for unassigned fragments”. The number of positions that are appended to the protein sequence and discarded (unassigned) fragments, if there are any, will be located there. It is safe to over-estimate this number. (If this number is under-estimated, this will force the mapping of spurious spin-systems onto protein sequence);
- “Number of SA trajectories”. The time needed for calculations is proportional to this number. On the other hand, having more SA trajectories the assignment probabilities could be calculated more accurately. In the case of good data, when all SA trajectories converge to assignments with the same energy, 10-15 trajectories should be enough. In the case of poor data, it is better to calculate 40-50 SA trajectories.
- “NOE bbcmap type”. User should specify which one NOE contact map, (abacus) or (fawn) should be used in the calculations;
- “Fixing position flag”. If the flag is set to 1, sequence position of all fragments which has assignment ID > -99 will be fixed during the simulation.
- “Final temperature”. Setting the optimal final temperature will provide all SA trajectories converge to optimal or sub-optimal assignments (the assignments that are in the vicinity of the global energy minimum). The optimal final temperature could be find by running one or a few SA runs with 3-4 trajectories and by analysing convergence of the trajectories from the report shown in the main FMCGUI window after each run (see Figure 2.10).
.
In the result of the SA calculations assignment probability map is
calculated and loaded in memory. The map is also saved in the file 'sa.probmap' located into PROJECTNAME/assignment/sa_run# directory.
[Assignment>Calculate Probabilities>REM]
<div </div><div </div>
: To calculate assignment probabilities.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments are loaded in memory;
- typing probabilities are calculated;
- fragments contact map is calculated;
- at least one of and fragments contact maps is calculated;
Probabilistic mapping of PB-fragments onto protein sequence is performed using Replica Exchange Method Monte Carlo simulations.
A new window “Calculate REM” is open were user can specify different parameters in the control file of the REM simulations (see Figure 2.11)
The main parameters to consider are:
- “Name of the REM run”. Normally the name is rem_run#. A new directory under this name will be created within PROJECTNAME/assign directory. REM calculations will be curried out and the results will be stored in this directory.
- “Size of the pool for unassigned fragments”. The number of positions that are appended to the protein sequence and discarded (unassigned) fragments, if there are any, will be located there. It is safe to over-estimate this number. (If this number is under-estimated, this will force the mapping of spurious spin-systems onto protein sequence);
- “Number of REM steps”. The time needed for calculations is proportional to this number. On the other hand, with more REM steps more extensive sampling of assignment space wil be achieved, which in turn results in more accurate assignment probabilities. This number should be increased for large proteins.
- “NOE bbcmap type”. User should specify which one NOE contact map, (abacus) or (fawn) should be used in the calculations;
- “Fixing position flag”. If the flag is set to 1, sequence position of all fragments which has assignment ID > -99 will be fixed during the simulation.
- “Low temperature”. The optimal low temperature will provide extensive sampling of should sub-optimal assignments during REM simulation.
User can check a correct setting of the low temperature as well as the number of REM steps by analysing a report shown in the main FMCGUI window after the calculations are done.
In the result, the 50 lowest energy assignments are used to calculate assignment probabilities . The probabilities are loaded in memory and saved in the file ‘rem.prbmap’ located into PROJECTNAME/assign/rem_run# directory.
[Assignment>Fix Assignment>Using Probability map]
<div </div>
: To perform sequence specific assignment of PB-fragments using results of SA or REM calculations.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments loaded in memory;
- at least one SA or REM calculations of assignment probabilities was done
Calculation of assignment probabilities with FMCGUI could be repeated a few times using different methods and parameters. Results of each calculation are stored in a separate directory with the user-specified name. Therefore, there could be a few different directories (for example, sa_run1, sa_run2, rem_run0, rem_run1, rem_run2) located within PROJECTNAME/assign/ directory that contain different assignment probability maps.
User will be asked to select the calculation directory (sa_run# or rem_run#) and to specify the value of acceptance probability P_a. Normally, P_a =0.9 is appropriate. A fragment is considered to be assigned to a sequence position if the corresponding assignment probability (taken from the selected directory) is >= P_a.
In the result, part of the PB-fragments will be assigned, namely, their assignment IDs A_id will be specified. The assignment report will be shown in the main FMCGUI window (see Figure 2.13) and saved in the corresponding simulation directory (‘sa.fix ‘or ‘rema.fix’ files) as well. For each sequence position, the IDs of both unambiguously and ambiguously assigned fragments are shown in the report. The list of discarded fragments is also presented (see Figure 2.13).
[Assignment>Fix Assignment>Manually]
<div </div>
: To perform sequence specific assignment of PB-fragments manually.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments are loaded in memory;
This command allows user to fix sequence position of individual fragments.
“Fragment Property Modification” window will be open (see Figure 2.14). User can select a fragment in the bottom section of the window and the ingormation regarding the fragment assignemts will be displayed in this section. Namely, the graph shows assignment probabilities ( or ) that are currently loaded in memory and the text part shows the chemical shifts making up the fragment and it’s assignment ID (A_id).
To modify the current fragment assignment user have to select sequence position on the graph (by clicking on it by mouse) and then to press ‘Update’ button.
In the result, Assignment ID of the selected fragment will be set to the selected sequence position.
There are two special positions “U” and “B” shown on the graph at the end of the protein sequence (see Figure 2.15)
Selecting “U” and pressing “Update” button results in changing assignment status of the fragments to Unassigned, that is A_id is set to -99.
Selecting “B” and pressing “Update” button results in fixing fragment position in the pool of discarded fragments.
[Assignment>Fix Assignment>Reset all]
<div </div>
: To change assignment status of all fragments to “Unassigned”.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments are loaded in memory;
In the result, for all fragments A_id is set to -99.
Assignment>Load Probabilities
: To load assignment probabilities in memory.
Prerequisites:
- protein sequence is loaded in memory;
- PB-fragments are loaded in memory;
- SA / REM calculations of assignment probabilities was done
User have to select a directory where SA or REM calculations were done (sa_run# or rem_run#).
The assignment probabilities from the selected directory will be loaded in memory.