Projects

There are two types of projects (also referred to as studies) that can be submitted to ENA: sequencing and umbrella.

sequencing project is created to group together sequences originating from a single organisation or from a consortium of coordinated organisations to make up a complete genome or metagenome. The sequencing projects may contain assembled genomic sequences, EST libraries or any other sequences that contribute to the assembly and annotation of the genome or metagenome.  A sequencing project can also group together next generation sequencing data (previously represented by the SRA study). 

An umbrella project is a hierarchical grouping of sequencing projects. Umbrella projects can also be used to group together other umbrella projects.

Common fields for both sequencing and umbrella projects

Project fields, collected at the time of submissions, are described in the following sections. Submission and update of project information is supported through the SRA Webin submission system. For further information, please contact  datasubs@ebi.ac.uk.

Field Description
Accession A unique identifier assigned to each project at the time of submission. Project created by ENA have either PRJEA or PRJEB prefix.
Short name A short descriptive name for the project (optional).
Title A short description of the project akin to an article title.
Abstract A detailed desciption of the project akin to an article abstract.

Sequencing project fields

Field Description
Sample You are asked to choose one of the following options:
  • I have sequenced a single individual or isolated organism.
  • I have sequenced multiple individuals or isolated organisms.
  • I have sequenced a mixed community of organisms.
Common name The common name for the organism (e.g. human).
Scientific name The scientific name for the organism (e.g. homo sapiens).
Taxon identifier The taxon identifier is a species or a strain level taxon identifier from the NCBI taxonomy database (e.g. 9606). For newly sequenced organism please contact datasubs@ebi.ac.uk for the creation of a new taxon identifier.
Strain The strain name (if applicable).
Genome assembly You are asked if the project will describe a genome assembly.
Functional annotation You are asked if the project will contain functional annotation. If yes, then a unique locus tag prefix will be created to be used when submitting EMBL-Bank /locus_tag feature qualifiers.
Locus tag prefix All sequencing projects containing functional annotation will be assigned a unique locus tag prefix. When submitting functional annotation to EMBL-Bank the locus tag prefix must be used in the /locus_tag feature qualifiers. For example, if the locus tag prefix for the project is XXX then the /locus_tag value must start with 'XXX_' followed by a string of letters and numbers uniquely identifying the locus within the context of the project. More information about the usage of /locus_tag in EMBL-Bank sequence records is available here.
Sequenced molecule You are asked if you have sequenced DNA or RNA:
  • I have sequenced DNA.
  • I have sequenced genomic RNA.
  • I have sequenced transcribed RNA.
Genome assembly You are asked if the project will describe a genome assembly.

Additional fields for DNA sequencing projects

Field Description
Enrichment/selection You are asked if you have applied any enrichment or selection:
  • No, I am sequencing the entire genome.
  • Yes, I am sequencing individual chromosomes.
  • Yes, I am sequencing captured exons.
  • Yes, I am sequencing epigenetic markers.
  • Yes, I am sequencing PCR amplicons.
  • Yes, I am doing a random sequencing survey.

Additional fields for transcribed RNA sequencing projects

Field Description
Enrichment/selection You are asked if you have applied any enrichment or selection:
  • Yes, I have created a cDNA library.
  • Yes, I have used a tag approach such as PET or CAGE.
  • No.

Umbrella project fields

Field Description
Common name When applicable it is possible to provide the common name for the organism (e.g. human).
Scientific name When applicable it is possible to provide the scientific name for the organism (e.g. homo sapiens).
Taxon identifier When applicable it is possible to provide the taxon identifier from the NCBI taxonomy database (e.g. 9606).
Strain When applicable it is possible to provide a strain name.