spacer
spacer

RNA-seq and ChIP-seq data

1. Types of data that can be submitted
2. How to submit your data
3. Which data files to submit
4. Example RNA-seq/ChIP-seq submission spreadsheets and experiments in ArrayExpress

 

1. Types of data that can be submitted

ArrayExpress accepts submissions of non-human and human non-identifiable RNA-seq and ChIP-seq data. To submit to ArrayExpress, all you need to do is fill in a simple spreadsheet and transfer your data files to us. We will transfer the raw data files to the European Nucleotide Archive for you.

If you have human potentially-identifiable sequencing data you need to submit to the European Genome-phenome Archive (EGA) and not ArrayExpress. They will supply you with a template for submission and store human identifiable data securely. They will then pass the non-identifiable data to us as shown in the diagram below.

 

Diagram of data submission routes

Diagram of the submission of different sequencing data types to ArrayExpress or the European Genome-phenome archive.

Top

 

2. How to submit your data

To submit data to us, fill out a simple spreadsheet with sample and protocol information and transfer your trace data files for each sample and any processed data files to us through our spreadsheet submission system web page.

The spreadsheet format we use is called MAGE-TAB.

To create a spreadsheet template based on some general information about your experiment and then submit it to ArrayExpress:

  • Log in to the spreadsheet submission system
  • Create an 'experiment'
  • Tick the "UHTS experiment" option and select terms describing your experiment
  • Click "Update" and then "Generate template"
  • Download the spreadsheet template and fill it in
  • Upload the completed spreadsheet with your data files then click "Submit"

Alternatively fill out the basic spreadsheet template for a sequencing experiment here: sequencing template and upload this through the spreadsheet submission system as described above.

We can also help you to create a template spreadsheet and answer any questions you may have about filling it out. Please email us at miamexpress@ebi.ac.uk.

If you have both sequencing and microarray-based data please create a spreadsheet for each and submit them as separate 'experiments' even if they are related. Our processing of the two types of data is slightly different.

If you have a large amount of data to submit please transfer your files by FTP to us instead of uploading it through the spreadsheet submission system web page. Please email us to tell us you are doing this so that we can identify your files on the FTP site.

Top

 

3. Which data files to submit

Submit the trace data files and any processed data files that you have e.g. files in which the expression values are linked to genome coordinates.

The trace data files that you submit to ArrayExpress will be stored in the European Nucleotide Archive (ENA). The following trace data file formats are accepted:

Technology Accepted file type Instructions
Illumina Solexa .srf Please download the Staden io_lib package and use the solexa2srf utility to convert run files to the SRF format. One SRF file should be generated for each lane. Please do not compress the SRF files as the format is nearly optimal in terms of compression.
SOLID .srf Please download the SOLID System SRF Conversion Tool (solid2srf) and use it to convert the files to the SRF format. Please do not compress the SRF files as the format is nearly optimal in terms of compression.
454 .sff Please submit SFF files so that each file can be associated with a single sample.

Fastq format files are also accepted, but SRF format files are preferred.

These are developing technologies and these recommendations may change. Please contact us with any questions for help or further information on submission of RNA-seq and ChIP-seq data.

Top

 

4. Example RNA-seq/ChIP-seq submission spreadsheets and experiments in ArrayExpress

RNA-seq example

Illumina Solexa RNA-Seq experiment - E-MTAB-197

ChIP-Seq example

Illumina Solexa ChIP-Seq experiment - E-MTAB-115

 

If you have any questions about submitting RNA-seq or ChIP-seq data or would like help creating a spreadsheet please contact us at miamexpress@ebi.ac.uk.

Top

spacer
spacer