Submitting environmental sequences

This covers the submission of data derived from the sequencing of DNA or RNA from organisms that have not been isolated and is typically focused on entire microbial communities, size fractionated organisms, enrichments cultures and flow sorted microbes. Both amplicon (metabarcoding, diversity survey) and shotgun (metagenomics/metatranscriptomics) based methods are included here.

Minimally, ENA aims to capture and present both raw data and identifications (taxonomic or functional) from environmental sequencing approaches. Additionally, intermediate information, such as mappings between raw reads and identifications and cluster information can also be captured.

Amplicon-based

There are currently two different Webin submission systems that both should be used for submitting this type of data.  The first should be used for registering samples and submitting raw sequencing reads, the latter is used to submit the assembled and annotated sequences.

Submitting reads and samples

The read domain Webin submission tool (see here) should be used to submit both raw sequencing reads and samples. Minimal sample description should be provided in this step as more detailed MIMARKS compliant information will be provided in the next step while submitting the annotated sequences. Take note of the sample accession as this will be needed during the sequence submission.

Shotgun-based

The ENA archives Next Generation Sequencing data from metagenomics and metatranscriptomics studies into the read domain. Comprehensive instructions on read data submission can be found here. These guidelines include both programmatic and interactive submissions.

Submitting metagenome assemblies

Metagenome assemblies (contigs, scaffolds and chromosomes) shall be submitted via the ENA (Meta)Genome Assembly Pipeline. A detailed guide on how to submit genomic and metagenomic assemblies is available here.

Sample contextual data

For all studies, but especially metagenomics and metatranscriptomics studies, it is essential that the read data files, study and sequencing experiment description are accompanied by minimal sample metadata information in order to ensure that the data originating from the study are reproducible, meaningful and can be sufficiently analysed.

Minimal reporting requirements for samples of metagenomics and metatrancriptomics studies are addressed in the MIxS data standard described here.

Metagenomic data analysis

Sequencing experiments correctly annotated as metagenomic and containing adequate sample contextual data are eligible for analysis by the EBI Metagenomics services. For further information please contact the EBI metagenomics team directly.

Latest ENA news

23 Jun 2017: New tools to download data from ENA

Introducing two new tools to make retrieving data from ENA much easier: enaBrowserTools and ENA FTP Downloader.

21 Jun 2017: Guide to metagenomics data standards and best practice published

Our latest publication on data standards and best practices in metagenomics is available at https://t.co/PWjYHeWovk and includes a call for the archiving of analysis results data sets. More...

27 Apr 2017: New ENA discovery API

ENA has launched a new API to support programmatic search across all data types: https://www.ebi.ac.uk/ena/portal/api

03 Apr 2017: ENA Release 131

Release 131 of ENA's assembled/annotated sequences now available