spacer
spacer


The ENA Sequence Version Archive is a repository of all entries which have ever appeared in EMBL-Bank Sequence Database.
You can use this page to browse the archive or use the batch retrieval form. Remove F.A.Q.
 Accession Number or Sequence Version:      case sensitive
Snapshot at  day-month-year (e.g. 30-11-1998 or 30-NOV-1998) Current version

Frequently Asked Questions
How can I access all the versions of particular entry?
How can I see what a particular entry looked like on a specific day?
How can I retrieve a batch of entries?
What options do I have from the results page?
What are the results of comparing versions?
What is a Primary Accession Number?
What is a Secondary Accession Number?
What is a Sequence Version?
What is a Protein Identifier?
What is an Entry Version?
What is a Release?
What is an Issue Date?
How can I access all the versions of particular entry?
Entries can be queried using either Accession Number, Nucleic Sequence Identifier or a Protein Sequence Identifier. On the query page you should type in or cut and paste the Accession Number or Sequence Identifier you wish to query and then press "Go!"
[Top]
 
How can I see what a particular entry looked like on a specific day?
It is possible to add a date to your query in order to retrieve a snapshot of an entry on a specific date. In addition to the Accession Number or Sequence Identifier you should enter the date you wish to see the snap shot of in the "Snapshot at" field e.g. 30-NOV-1998), and then press "Go!" General information.
[Top]
 
How can I retrieve a batch of entries?
You can prepare a text file containing a list of <accession number>.<version> (one for each line) and use the 'batch retrieval form'.
The '.<version>' part can either refer to the entry version or to the sequence version. To indicate which one of the two you are referring to you have to check the appropriate 'Identifier version' button on the form.
It is possible to retrieve the most recent entry for an accession number by omitting the '.<version>' part in the list file.
There is a limit on the number of entries that can be retrieved using the batch faciliy, if you need to have a big number of entries please contact support@ebi.ac.uk
[Top]
 
What options do I have from the results page?
You can do a number of different things from the results page. - you can start a new query - you can choose to list all matches if there are more than 20 - you can choose to view the next or previous 20 matches - you may view an individual flat file by clicking on "view" in the right hand column - you may compare two versions by flagging the desired versions (in the left hand column) and clicking "compare selected" - you may choose to save a number of flat files by selecting the desired versions (in the left hand column) and clicking "save selected". You can save in either EMBL or FASTA format.
[Top]
 
What are the results of comparing versions?
When you compare 2 different versions you will see a single flat file, which uses different coloured lines to represent whether a line has remained unchanged (white), whether it has been deleted (orange), or whether it has been inserted (green). From this page, you can click "back to list" to return to your original results page.
[Top]
 
What is a Primary Accession Number?
Every database entry has a unique primary accession number. It appears on the AC line of an entry. The format is either 1 letter + 5 digits (e.g. X00123), 2 letters + 6 digits (e.g. AJ000123), or 4 letters + 8 digits (for WGS, e.g. BAAC01000001).
[Top]
 
What is a Secondary Accession Number?
Some entries have secondary accession numbers in addition to the primary accession number. The format of the secondary accession numbers is the same as for the primary accession numbers.
[Top]
 
What is a Sequence Version?
The sequence version appeared on the NI line in Release 47 (it did not exist before this time), with the format 'd' or 'e' or 'g' followed by digits (e.g. d12235345). This format was superceded by the SV line in Release 57, with the current format (e.g. AA123456.1). Queries are converted to uppercase by default, so users should tick the "case sensitive" box when querying by NI.
[Top]
 
What is a Protein Identifier?
The protein identifier appeared as a /note feature qualifier in Release 43 (e.g. /note="pid:g2285). It was superceded by a /db_xref feature qualifier in Release 45 (e.g. /db_xref="PID:g2285"). It was superceded by a /protein_id feature qualifier in release 58 (e.g. /protein_id="CAA03857.1"). Queries are converted to uppercase by default, so users should tick the "case sensitive" box when querying by pid or PID.
[Top]
 
What is an Entry Version?
Each time an entry is modified, excluding taxonomic, database cross-reference, and journal name changes, the entry is assigned a new entry version number. The EMBL Sequence Version Archive captures all the entry changes, including changes in the flat file format. As a result multiple entries with the same entry version may coexist in the archive.
[Top]
 
What is a Release?
The release refers to the quarterly EMBL release in which a flat file appeared, or was expected to appear.
[Top]
 
What is an Issue Date?
The issue data is the date when the entry was made available to the public as part of daily update, or release.
[Top]
 
spacer
spacer