Experiment submissions overview

Annotare webform tool


Read this guide before submitting
Annotare login page

  • For all non-pipeline submissions of microarray or sequencing experiments. Pipeline submitters: please contact us at annotare@ebi.ac.uk and we will advise further
  • Fill in a series of webforms (auto-saved), featuring auto fill-down and copy/paste ("import values") functions
  • Supports up to 1000 assays, no complex design (e.g. one microarray design used, no sample pooling)
  • Upload data files directly within the tool or via FTP
  • Automatic validation prior to submission to speed up downstream processing and loading into ArrayExpress

 

1. What data sets do we accept?
2. Major components of an experiment submission

What data sets do we accept?

ArrayExpress accepts all functional genomics data, e.g. expression profiling, genotyping, chromatin immunoprecipitation, copy-number variation, from microarray and sequencing technologies.

Microarray submissions follow the "Minimum Information About a Microarray Experiment" (MIAME) guidelines, while sequencing submissions follow a similar set of guidelines, "Minimum Information About a Sequencing Experiment" (MINSEQE).

Please try and get your submission in the best possible shape to avoid retrospective fixes post-curation, because such fixes could be tedious and lengthy (especially for sequencing experiments).

Data from human samples and individuals that can potentially lead to the identification of the donors (e.g. genomic DNA sequences) can be submitted to ArrayExpress if the data has been consented for public release. Such approvals typically would be given by the relevant ethics committees and ensuring this is the responsibility of the submitters. Identifiable data approved for controlled access should be submitted to the European Genome-phenome Archive (EGA) (more details here).

Major components of an experiment submission

Following the MIAME/MINSEQE guidelines, we ask submitters to provide:

 

Meta-data: E.g. Background biology of your experiment, aim of the experiment, biological materials/samples used and their characteristics, wet-lab and dry-lab procedures (protocols).

Raw data files: These are unprocessed data files obtained from the microarray scanner (e.g. Affymetrix CEL files, Agilent feature extraction files) or from the sequencing machine (e.g. fastq files).

Processed data files: These are data files generated from the raw data files, often involving e.g. normalisation or background-subtraction. In sequencing experiments, the processing can also involve alignment of sequencing reads to a reference genome, calculation of RPKM/FPKM values, etc.

  • Microarray experiment: Send us processed data in tab-delimited text (*.txt) files (not Excel spreadsheets) as in this this guide.
  • Sequencing experiment: We accept tables/spreadsheets of processed data saved as tab-delimited text (*.txt) format. We also accept bam (alignment) files, see bam file specification for more details.

Top