Resonance Assignment/Abacus/Protein Sequence format

From NESG Wiki
Revision as of 21:47, 26 November 2009 by AlexLemak (talk | contribs)
Jump to navigation Jump to search

1. Fasta format.

The first line shoul start with '>' symbol.

Next one or more lines contain sequence in 1-letter code.


>MDSKEVLVHVKNLEKNKSNDAAVLEILHVLDKEFVPTE
 KLLRETKVGVE VNKFKKSTN VEISKLVKKMISSWKDAIN 


Example 1:

>MDSKEVLVHVKNLEKNKSNDAAVLEILHVLDKEFVPTE KLLRETKVGVE VNKFKKSTN VEISKLVKKMISSWKDAIN




2. "Standard" format.

Each line contains name of one residue in 3-lettr code and, optionally, the residue ID.

(only residue ID of the first residue is used to start numbering of the sequence positions).




Example 2.1 (position ID is not specified)

GLN GLY HIS MET PRO GLY ILE ILE TYR GLU GLY LYS GLY THR ASN MET GLU ....

Example 2.2 (with specified all position ID):

GLN -3 GLY -2 HIS -1 MET 0 PRO 1 GLY 2 ILE 3 ILE 4 TYR 5 .....

Example 2.3 (with specified first position ID):

GLN -3 GLY HIS MET PRO GLY ILE ILE TYR .....