Summary
- The EGA is a data repository which provides services for permanent archiving and distribution of personally identifiable genetic and phenotypic data resulting from biomedical research projects.
- Files at the EGA are sorted into datasets. The datasets are defined as collections of related files with a common data access policy. The original submitter, as part of the submission process, generates datasets to which users then apply for access after the data release.
- DAC retains ownership of the dataset/s that it governs. This includes the data files, associated metadata and access decisions for each dataset. The role of the EGA is to act as a guardian of the data, to ensure that files are archived and distributed securely to the users approved by the DAC.
- You can download dataset(s) of interest once your request has been approved by the appropriate Data Access Committee by using the EGA download client – pyEGA3.
- The EGA is committed to its involvement in the work of GA4GH. EGA is currently implementing the Data Use Ontology (DUO) functionality, to enhance data discoverability and streamline the data access procedures.
- You can submit various types of potentially identifiable genetic and phenotypic human data
- Some important EGA video tutorial links can be found below
| Introduction to exploring the EGA | https://embl-ebi.cloud.panopto.eu/Panopto/Pages/Viewer.aspx?id=5e17012d-ff9f-482c-846c-ace701330301 |
| EGA Python download client | https://embl-ebi.cloud.panopto.eu/Panopto/Pages/Viewer.aspx?id=be79bb93-1737-4f95-b80f-ab4300aa6f5a |
| Python download client FAQs | https://ega-archive.org/download/downloader-quickguide-APIv3 |