- Course overview
- Search within this course
- Overview of key IMPC concepts and tools
- Solr query syntax: simplified explanation
- How to use the solr_request function from the impc-api python package
- How to perform a query: q parameter
- Exercise 1: getting familiar with the core
- How to request a limited number of documents: rows parameter
- Exercise 2: requesting three documents
- How to get specific fields: fl parameter
- Exercise 3: selecting specific fields
- Quiz 4: basic Solr parameters
- Filtering data in Solr: narrowing down your results
- How to query a specific field: filter by value
- Exercise 4: filtering by a single field
- How to filter numbers: range search
- Exercise 5: changing the p-value threshold
- How to combine multiple filters: Boolean operators
- Exercise 6: applying multiple filters
- How to exclude data: NOT operator
- Why parentheses are important: combine multiple Boolean operators
- Quiz 5: Boolean operators
- How to handle with null values: exclude empty fields
- Exercise 7: explore null values
- Downloading data: getting large results efficiently
- How to download large dataset effectively: pagination
- How to download the data: batch_solr_request function
- What formats are available for downloading: wt parameter
- Exercise 8: download the data
- What is the difference: JSON vs CSV
- What you need to keep in mind: query responsibly
- Quiz 6: request only necessary data
- Advanced Solr query techniques: faceting and iterating over entities
- Understanding IMPC data: resources and assistance
- Your feedback
A Python module to access IMPC data: installation and available functions
Instead of using direct requests in the browser, we will use the Python package impc_api to access IMPC data. impc_api provides several helper functions that build upon the IMPC Solr API.
To install the package, simply run the following command:pip install impc_api
You can import two functions via the command:from impc_api import solr_request, batch_solr_request
- The
solr_requestfunction is designed for small, simple requests that allow you to do prototyping. - The
batch_solr_requestfunction is made for large requests and downloading data.
To get help, add a question mark before the function name:? solr_request
We will discuss both functions in more detail later during the tutorial.