What is the EGA?


The European Genome–phenome Archive (EGA) is a permanent repository for all types of potentially identifiable genetic and phenotypic data from biomedical research projects. 

Why do we need the EGA?

Participants in medical or genetic research projects have typically provided consent for their data to be used in research but not for open public distribution. These data require a secure archiving, processing and disseminating service that respects the original informed consent agreements. The EGA was created as a service to make sure that all such data can be made available for the researchers who have been granted access to the data in a secure and controlled way.

About the EGA

The resource accepts raw data from sequencing, genotyping, transcriptome or epigenetics experiments using next-generation sequencing platforms or array-based technologies. The EGA can also be used to archive any processed data, such as the locations of individual variations (e.g. SNPs) from the raw data or summary statistics from a particular project. The samples can be associated with phenotype data that have been consented for use in research. The EGA supports pre-publication data release in accordance with the Toronto agreement.

The EGA can integrate the data with other available EMBL-EBI resources, for example by providing full genomic annotation via Ensembl for those variants that show significant association with the studied phenotype, or links to ArrayExpress for accessing expression data deposited from the same cohort members. The data can also be integrated across individual studies. EGA data, together with data from our partner Data Access Committees (DACs), are made available to users in the most widely used formats.

The studies in the EGA include summary statistics, dense genotype experiments between cases and controls, population based studies or resequencing and RNA-seq data from various cancer genomes.