Skip Header

 

Release 9.5

Published January 23, 2007

Headlines

Reintroduction of the initiator methionine

In UniProtKB/Swiss-Prot, the sequence data corresponds to the precursor form of a protein, i.e. before post-translational modifications such as cleavage of the signal peptide or other processing. However, for historical reasons, a notable exception was made: when the initiator methionine was post-translationally removed, the sequence stored in UniProtKB/Swiss-Prot did not include the methionine and instead started with the second residue.

As a consequence, our sequence data differed from that shown in other sequence databases where the initiator methionine is usually not removed. This discrepancy was confusing for users and was the subject of one of the most frequently asked questions to UniProtKB/Swiss-Prot.

This is no longer the case. With this release, all initiator methionines have been reintroduced to the UniProtKB/Swiss-Prot entries (over 10'000) from which it is cleaved. This caused a major change, since all amino acid positions described in these entries have now been updated to reflect the new sequence numbering.

The cleavage of the initiator methionine is still indicated by the INIT_MET line in the feature table but the sequence position is 1 instead of 0. We also added the comment Removed in the description field of INIT_MET line to indicate that the initiator methionine is indeed removed post-translationally.

Example P51487:

Previous format:

FT   INIT_MET      0      0       
FT   CHAIN         1    400       Phosrestin-1.
...
SQ   SEQUENCE   400 AA;  44781 MW;  DA786D7E9FFB4A29 CRC64;
      VVSVKVFKK ATPNGKVTFY LGRRHFIDHF DYIDPVDGVI VVDPDYLKNR KVFAQLATIY

New format:

FT   INIT_MET      1      1       Removed.
FT   CHAIN         2    401       Phosrestin-1.
...
SQ   SEQUENCE   401 AA;  44912 MW;  1212C2422CD35A94 CRC64;
     MVVSVKVFKK ATPNGKVTFY LGRRHFIDHF DYIDPVDGVI VVDPDYLKNR KVFAQLATIY