0%

Data downloads

In some instances, instead of looking for data associated with a specific gene or phenotype, you may want to download all available data for all the mouse lines the IMPC has phenotyped or for a selection of genes. There are four ways to do this:

  1. Directly from the website: For the data associated with a specific gene or phenotype, look for the download buttons in gene and phenotype pages.
  2. The FTP site: You can download all data via the FTP site; please check the README files in the FTP directory and our non-programmatic data access documentation for more information on this.
  3. Batch Query Service: To get significant phenotype data for a selection of genes you can use the Batch Query functionality in the IMPC website.
  4. Programmatic access via the API: To get specific data for a selection of genes, please refer to our programmatic data access documentation for more information on how to do this. The Python package impc_api provides several helper functions that build upon the IMPC Solr API. For an interactive introduction to using Python to access the API, take a look at the Accessing Mouse Phenotypes and Disease Associations with the IMPC Solr API course. This course teaches you how to extract information from the IMPC Solr endpoints using the impc_api package that you can install with pip and provides practical examples that you can try out in Google Colab or your own Jupyter notebook.
Figure 30 There are various ways to access the IMPC Data; multi gene and bulk downloads are available from the “Accessing the Data” section from the menu “Data” in the IMPC Homepage. For specific gene or phenotypes, please look for the Download buttons in the gene and phenotype pages.