BioStudies Team

The BioStudies database holds descriptions of biological studies, links to data generated in these studies in key community databases at the EMBL-EBI and elsewhere, as well as “orphan data” traditionally provided as supplementary materials. The database can accept data from a wide range of studies described via a simple format, and does not impose minimum requirements outside those agreed by the respective community.

The overall goal of BioStudies is to facilitate transparency and reproducibility of research by aggregating all the outputs of a study (a ‘data package’) in a single place. We are enabling this aggregation across the various stages of research.

  • Ideally scientists consider data management and publishing while the investigation is running, and we are partnering with projects like RISK-HUNT3R to make sure that well-structured data is captured, ready for release upon publication.
  • When data is captured during the manuscript preparation stage, we enable authors to submit supplementary information and cite it in the publication. Specialized community databases should be used when applicable, and BioStudies enable linking to these, as well as submitting orphan data that do not have a dedicated ‘home’ resource.
  • The BioStudies system can be used for an emerging public data infrastructure, as exemplified by the BioImage Archive.
  • We create data packages also after publication, by importing supplementary data and text-mined database links from Europe PMC, and curated data on figures from the SourceData project.
  • BioStudies can also help with publishing data that live in a resource being retired – an example is ArrayExpress.

Please contact us at biostudies@ebi.ac.uk for further information or collaboration ideas.

Data resources