What data is available through the Portal?
The European COVID-19 Data Platform consists of three connected components:
- SARS-CoV-2 Data Hubs, which organise the flow of SARS-CoV-2 outbreak sequence data and provide comprehensive open data sharing for the European and global research communities. Essential metadata, such as sampling tracking identifiers, sampling time, geographical location, method of sampling, health status of host and sequencing platform/strategy, are captured alongside sequence data. The SARS-CoV-2 Data Hubs also provide systematic data processing and analysis, visualisation and phylogenetic analysis tools.
- Federated European Genome-phenome Archive, which provides secure controlled access sharing of sensitive patient and research subject data sets relating to COVID-19. With technical tools for the deployment of nationally located secure database nodes and appropriate protocols to connect nodes across the federation for search and access requests, the system supports national data management requirements for genomic and clinical data collected from citizens as part of healthcare or biomedical research projects. Access to the datasets is provided to authorised researchers using Data Access Committee-centred processes and secure protocols already deployed for the EGA service and implemented nationally as part of the federated EGA model.
- COVID-19 Data Portal, which brings together COVID-19-related data and scientific literature held in EMBL-EBI’s data resources, including ENA, UniProt, PDBe, EMDB, Expression Atlas and Europe PMC. To ensure availability of the latest data sets, the COVID-19 Data Portal is synchronised with these data resources. The data continue to grow in diversity and volume and include sequences, structures, expression data, compound screens, biochemistries and scientific publications.
In this video Guy Cochrane provides an overview of what kind of data can be viewed using the COVID-19 Data Portal and the tools available.