Metadata model

The goal of this document is to describe the metadata model and to give sufficient information to submitters to be able to use the objects required for Webin submissions. 

If you have any questions please contact datasubs@ebi.ac.uk.

 

The metadata model contains the following objects:

Object Schema Description
Submission SRA.Submission.xsd A submission contains submission actions to be performed by the archive. A submission can add more objects to the archive, update already submitted objects or make objects publicly available.
Study SRA.study.xsd

A study groups together data submitted to the archive. Please use the study accession number when citing data submitted into ENA. All associated data and other objects are made public when the study release date expires.

Sample SRA.sample.xsd A sample contains information about the sequenced samples. Samples are associated with checklists, which define the attributes used to annotate the samples, and experiments or analysis objects.
Experiment SRA.experiment.xsd An experiment contains information about the sequencing experiments including library and instrument detail.
Run SRA.run.xsd Runs are part of experiments and contain sequencing reads submitted in data files (e.g. BAM or CRAM). Each run can contain all or part of the results for a particular experiment.
Analysis (SEQUENCE_ASSEMBLY) SRA.analysis.xsd

An analysis contains secondary analysis results computed from the primary equencing reads. There are four types of analyses.

This type of analysis is used for genome assembly submissions.

Analysis (REFERENCE_ALIGNMENT)

SRA.analysis.xsd This type of analysis is used for read re-alignment submissions.

Analysis (SEQUENCE_VARIATION)

SRA.analysis.xsd This type of analysis is used for sequence variation submissions.
Analysis (SEQUENCE_ANNOTATION) SRA.analysis.xsd This type of analysis is used for sequence annotation submissions.
EGA DAC EGA.dac.xsd An European Genome-phenome Archive (EGA) data access committee (DAC). Required for authorized access submissions.
EGA Policy EGA.policy.xsd An European Genome-phenome Archive (EGA) data access policy. Required for authorized access submissions.
EGA Dataset EGA.dataset.xsd An European Genome-phenome Archive (EGA) data set. Required for authorized access submissions.

 

The relationships between these objects is described in the picture below:

Webin metadata model

Latest ENA news

19 Jan 2018: Forthcoming changes to WGS and TSA sequences

ENA is making changes to provision of WGS and TSA sequences

05 Jan 2018: ENA release 134

Release 134 of ENA's assembled/annotated sequences is now available

21 Dec 2017: ENA services over the holiday period

Between Friday 22nd December and Tuesday 2nd January ENA services such as submissions and retrieval...

21 Dec 2017: ENA release 134 expected early January

The last release of assembled and annotated sequences for 2017 (134) has been particularly...