ENA accession numbers

There are a set of defined rules that describe the format of ENA accession numbers.  The regular expressions for each record accession number are listed below. Please note the "\.\d+" at the end denoting sequence versions for the assembled/annotated and protein coding sequences, and genome collections.

Accession number type Accession number format
Asssembled/Annotated sequences [A-Z]{1}\d{5}\.\d+
Protein coding sequences [A-Z]{3}\d{5}\.\d+
Traces TI\d+
Studies (E|D|S)RP\d{6,}
Samples ERS\d{6,}
Experiments (E|D|S)RX\d{6,}
Runs (E|D|S)RR\d{6,}
Analyses (E|D|S)RZ\d{6,}
Genome collections GCA_\d{9}\.\d+

Latest ENA news

19 Jan 2018: Forthcoming changes to WGS and TSA sequences

ENA is making changes to provision of WGS and TSA sequences

05 Jan 2018: ENA release 134

Release 134 of ENA's assembled/annotated sequences is now available

21 Dec 2017: ENA services over the holiday period

Between Friday 22nd December and Tuesday 2nd January ENA services such as submissions and retrieval...

21 Dec 2017: ENA release 134 expected early January

The last release of assembled and annotated sequences for 2017 (134) has been particularly...