Data formats available from the browser

The ENA browser supports several different data types.  While all objects are available in HTML and XML format, the other formats differ based on the data type.  Please refer to the table below for a summary of the supported formats for each data type in ENA. The 'Example' column contains example entries retrieved from the ENA Browser in HTML, XML, Fasta, Fastq and flatfile formats. The ENA Browser provides retrieval and visualisation functionality over ENA data and metadata and uses REST URLs to support both interactive and programmatic access. The 'Schema' column points to the XML Schemas that describe the data class specific XML formats. Please note that all XML documents returned by the ENA Browser are included in and validate against the ENA.root.xsd XML Schema.

Datadomain Data object Definition Example XML Schema
Assembly Assembly

A record detailing the construction of reads and sequence contigs into higher order scaffolds and chromosomes.

HTML
XML
ENA.assembly.xsd
Sequence Sequence A record representing an assembled and annotated sequence Fasta
Flatfile
HTML
XML
ENA.embl.xsd
Coding Coding A record representing an annotated coding region derived from assembled sequences Fasta
Flatfile
HTML
XML
ENA.embl.xsd
Non-coding Non-coding A record representing an annotated non-protein-coding region derived from assemble sequences Fasta
Flatfile
HTML
XML
ENA.embl.xsd
Analysis Analysis A record pointing to and describing a set of analysis files.     SRA.analysis.xsd
Analysis-file A record containing secondary analysis results computed from primary sequencing results.   Not available
Run Experiment A record containing information about a next generation sequencing data set, covering for example library and platform information. HTML
XML
SRA.experiment.xsd
Run A record pointing to and describing a 'Run-file' record. HTML
XML
 SRA.run.xsd
Run-file A record containing raw next generation sequence data including, for example, base calls and per-base quality scores. Fastq
CRAM
BAM
Not available
Trace Trace info A record providing sequenced sample, library and machine configuration for capillary sequencing data HTML
XML
Not available
Trace-file A record containing capillary sequence reads data, including base calls and quality scores. Fasta
Fastq
Not available
Sample Sample A Sample contains information about the sample upon which the next-generation sequencing experiments are based. HTML
XML
SRA.sample.xsd
Taxon Taxon Information relating to the organism that served as the source of material sequenced and its classification HTML
XML
ENA.taxonomy.xsd
Study Study Record that serves to unite content otherwise dispersed across ENA, typically into read, assembly, transcriptome and targeted locus studies, etc. HTML
XML
ENA.project.xsd
SRA.study.xsd
Submission Submission A record containing submission and update transaction details for the use of submitters during communication with ENA. HTML
XML
SRA.submission.xsd

Latest ENA news

11 Oct 2017: Read data download issues resolved

Read data download issues previously affecting ftp.sra.ebi.ac.uk and fasp.sra.ebi.ac.uk services now resolved.

06 Oct 2017: ENA read data download issues

Issues with read data download from ftp.sra.ebi.ac.uk and fasp.sra.ebi.ac.uk

04 Oct 2017: ENA Release 133

Release 133 of ENA's assembled/annotated sequences now available