Online tutorial

Accessing Mouse Phenotypes and Disease Associations with the IMPC Solr API

A complete Python guide

Time to complete:

3 hours

This course includes:

  • Activities
  • Quizzes

Written by:

Last reviewed:

November 2024


Creative Commons

All materials are free cultural works licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license, except where further licensing details are provided.


Share this course with:

The International Mouse Phenotyping Consortium (IMPC) aims to identify the function of every protein-coding gene in the mouse genome. This tutorial will walk you through accessing IMPC data programmatically through the IMPC Solr API with Python. 

Feedback and help

Who is this course for?

This course is designed for anyone interested in learning how to access IMPC data using Python. It will provide you with the necessary skills to navigate and utilise the data effectively. However, basic Python programming skills are required so an introductory Python course is recommended. If you're new to IMPC or want to explore its resources further, we suggest starting with our course The International Mouse Phenotyping Consortium: Finding phenotypes for your gene of interest.

What will I achieve?

By the end of the course you will be able to:

  • Explain how IMPC data is organised and what kind of information you can access.
  • Use helper functions from the impc_api Python package to request IMPC data through the IMPC Solr API
  • Customise query parameters to retrieve the data you are interested in
  • Download IMPC data efficiently with Python using pagination

What resources do I need?

This tutorial includes exercises using Python to access the IMPC API. The exercises can either be completed using a Google Colab or Jupyter Notepad. There is further information about these choices on this page in the tutorial

DOI: 10.6019/TOL.IMPCSolrAPI-t.2024.00001.1