Programmatic submissions (REST)

EGA supports programmatic sequence and analysis (vcf/phenotype/aligned BAM) metadata submissions. Metadata validation and accessioning takes place immediately during the submission. A receipt with corresponding accessions is provided upon successful submission. 

The production service is available here:

https://www.ebi.ac.uk/ena/submit/drop-box/submit/

The test service is available here:

https://www-test.ebi.ac.uk/ena/submit/drop-box/submit/

Submitters are advised to explore the test service before submitting for the first time. The test service is identical to the production service except that all submissions will be discarded on the following day.

 

Authentication

Submitters are authenticated each time a submission is made. The submission service uses HTTPS protocol for metadata encryption and secure identification to provide a confidential submission environment. 

 

Data file upload

Data files must be uploaded to your submission account before they may be validated using this servlce. More information on uploading your read data is available by here.

 

 

The metadata model contains the following xml objects:

Study (EGAS): information about the sequencing study

Samples (EGAN): Information about the sequencing samples

Experiment (EGAX): information about the libraries, platform; associated with study, sample(s) and run(s)

Runs (EGAR): contains the run data files; associated with experiments.

Analysis (EGAZ): contains the analysis data files; associated with study and sample objects

DAC (EGAC): contains information about the Data Access Committee (DAC)

Policy (EGAP): contains the Data Access Agreement (DAA); associated with DAC

Dataset (EGAD): contains the collection of runs/analysis data files to be subject to controlled access; associated with Policy

Submission: information about the submission actions (such as ADD or VALIDATE and PROTECT to specify an EGA submission)

The Submission, Study, Sample, Experiment, Run and Analysis objects share the same schema as those used by the European Nucleotide Archive (ENA) for open and public submissions.  

DAC, policy and dataset are objects that are unique to the EGA and are used to define and describe the submission controlled access requirements.

 

Link to the latest xml object schemas (1.5)  
Please note, the EGA utilises the xml schemas maintained at the European Nucleotide Archive (ENA).

Further information about working with XML

Further information on preparing your EGA xmls