ENA accession numbers

There are a set of defined rules that describe the format of ENA accession numbers.  The regular expressions for each record accession number are listed below. Please note the "\.\d+" at the end denoting sequence versions for the assembled/annotated and protein coding sequences, and genome collections.

Accession number type Accession number format
Asssembled/Annotated sequences [A-Z]{1}\d{5}\.\d+
[A-Z]{2}\d{6}\.\d+
[A-Z]{4}S?\d{8,9}\.\d+
Protein coding sequences [A-Z]{3}\d{5}\.\d+
Traces TI\d+
Studies (E|D|S)RP\d{6,}
PRJ(E|D|N)\d+
Samples ERS\d{6,}
SAM(E|D|N)[A-Z]?\d+
Experiments (E|D|S)RX\d{6,}
Runs (E|D|S)RR\d{6,}
Analyses (E|D|S)RZ\d{6,}
Genome collections GCA_\d{9}\.\d+

Latest ENA news

12 Jul 2017: Submission service maintenance - 14/7/17 to 17/7/17

Webin submission services will not be available between Friday 14/7...

07 Jul 2017: Update to Aspera server

EBI has built a new Aspera server on up-dated hardware with the latest Aspera version and configuration. This should improve...

06 Jul 2017: ENA Release 132

Release 132 of ENA's assembled/annotated sequences now available

30 Jun 2017: Taxon support for sequence, WGS and assembly in ENA Browser Tools

You can now download sequence, WGS and assembly data by tax ID using ENA Browser Tools