Submitting data to the DGVa


Submission formats

The DGVa accepts submission of processed structural variant data as an Excel spreadsheet and/or tab-delimited text files. Firstly, raw data from the experiments should be submitted to appropriate archives such as ArrayExpress, the European Nucleotide Archive (ENA) or the European Genome-phenome Archive (EGA). The identifiers received from these archives will be used as part of the submission process to the DGVa.

For small studies, it is probably easiest to submit your data using our Excel spreadsheet template. Submission information is grouped into a number of worksheets, with details such as: submitter's contact information, general information about the study, descriptions of samples, the experimental procedures used, details of the variation observed in individual samples ('variant calls') and how these have been merged within a genomic region ('variant regions'). An accompanying notes section in the Excel template provides assistance with the submission process – this is also available to download as a PDF. It is advisable to read this notes in their entirety before embarking on the submission process.  The latest version of the Excel template and notes can be downloaded from the DGVa Submissions webpage.

For larger studies, for example when you have thousands of variants and regions to report, you may wish to substitute any or all of the worksheets for tab-delimited text files. Use the field names on the corresponding worksheet as the header record and give the text file exactly the same name as the Excel worksheet name.

The DGVa has reasonably well-defined criteria for the type of information that can be entered. However, if you would like to add extra information, or need additional guidance, please email the dgva-helpdesk [at] (DGVa helpdesk).  

Making your submission to the DGVa

Email the submission file(s) to the dgva-admin [at] (submissions team). DGVa can also accept submissions via FTP if the files are too large to email – please email the dgva-helpdesk [at] (DGVa helpdesk) to request this and details will be provided.

What happens next?

After you have made your submission to DGVa, the next steps in the process are:

1. Validation

Your submission files are validated to ensure that they provide the required information, in the correct format. You will be notified by email if amendments are requested.

2. Accessioning

After the data have been uploaded to the archive, unique and stable accession numbers will be granted for the study, variant regions, and all sample level variant calls (see DGVa data).

3. Public Release

The study dataset becomes publicly available via a number of sources (see Getting data from the DGVa). Public releases normally occur on the last Thursday of each month.