Reporting standards

Harmonization of data and metadata collection becomes an essential effort in the age when data generation is often easier and more affordable then their organization and storage.

Compliance of submitted data to the relevant reporting standards promotes:

  • consistent and adequate data description
  • thorough data validation
  • data discoverability
  • data reproducibility
  • data interoperability and usability

ENA/INSDC reporting standards

The European Nucleotide Archive requires, where appropriate, use of the following reporting standards:

  • Feature Table – Description of nucleotide sequence provenance and functional annotation of nucleotide sequence domains
  • Third Party Data – Guidelines for submission of assembly and/or annotation or existing INSDC reads and primary sequences by a third party
  • Genome Assembly – Guidelines for submission of genome assemblies

Community-developed reporting standards

The European Nucleotide Archive supports use of the following community-developed reporting standards:

  • BARCODE – Minimum information about a species BARCODE sequence
  • GMI:MDM – Minimal Data for Mapping in relation to the Global Microbial Identifier pathogen tracking initiative
  • Micro B3 – Minimum information about marine microbial sampling
  • MINSEQE – Minimum Information about a high-throughput Nucleotide SeQuencing Experiment
  • MIxS – Minimum Information about any (x) Sequence

Specialised databases

European Nucleotide Archive submitters may also wish to submit to the following specialised databases after acquiring an INSDC accession number from ENA:

  • IPD-IMGT/HLA – For Human Leukocyte Antigen sequences, overseen by the WHO HLA Nomenclature Committee
  • IPD-MHC – For non-Human Major Histocompatibility Complex sequences, overseen by Comparative MHC Nomenclature Committee
  • IPD-KIR – For Human Killer-cell Immunoglobulin-like Receptor sequences

Latest ENA News

20 Aug 2014: Read data through Globus GridFTP
Read data can now be downloaded using Globus GridFTP through ebi#ena Globus Online public endpoint.

18 Aug 2014: Changes to SRA XML 1.5
Small changes to Experiment XML, Analysis XML, EGA Dataset XML, EGA DAC XMLs were deployed on 11th of August 2014.

1 Jul 2014: ENA release 120
Release 120 of ENA's assembled/annotated seqences now available

23 May 2014: Change to date format for advanced search
From 16th June 2014, the date format used in the advanced search will be changed to ISO format (YYYY-MM-DD).

20 May 2014: Update to the ENA SAMPLE checklist
From 10th of June 2014 the ENA SAMPLE checklist XML will be updated and the older version will be deprecated.