- Course overview
- Search within this course
- Overview of key IMPC concepts and tools
- Solr query syntax: simplified explanation
- How to use the solr_request function from the impc-api python package
- How to perform a query: q parameter
- Exercise 1: getting familiar with the core
- How to request a limited number of documents: rows parameter
- Exercise 2: requesting three documents
- How to get specific fields: fl parameter
- Exercise 3: selecting specific fields
- Quiz 4: basic Solr parameters
- Filtering data in Solr: narrowing down your results
- How to query a specific field: filter by value
- Exercise 4: filtering by a single field
- How to filter numbers: range search
- Exercise 5: changing the p-value threshold
- How to combine multiple filters: Boolean operators
- Exercise 6: applying multiple filters
- How to exclude data: NOT operator
- Why parentheses are important: combine multiple Boolean operators
- Quiz 5: Boolean operators
- How to handle with null values: exclude empty fields
- Exercise 7: explore null values
- Downloading data: getting large results efficiently
- How to download large dataset effectively: pagination
- How to download the data: batch_solr_request function
- What formats are available for downloading: wt parameter
- Exercise 8: download the data
- What is the difference: JSON vs CSV
- What you need to keep in mind: query responsibly
- Quiz 6: request only necessary data
- Advanced Solr query techniques: faceting and iterating over entities
- Understanding IMPC data: resources and assistance
- Your feedback
Output of the simplest request in your browser
To see the results of the simple request from the previous page, open follow this link. The results will look like this:
{
"responseHeader":{
"status":0,
"QTime":0,
"params":{
"q":"*:*",
"fl":"marker_symbol,effect_size,p_value",
"rows":"3"}},
"response":{"numFound":5197637,"start":0,"docs":[
{
"effect_size":0.0,
"marker_symbol":"Sarnp",
"p_value":1.0},
{
"marker_symbol":"Klhdc2"},
{
"effect_size":0.697501747030049,
"marker_symbol":"Uap1",
"p_value":6.96763811835258E-10}]
}
}
The output is a JSON file. Each object in curly brackets is a key:value pair. We have two sections:
- The
responseHeadersection provides metadata about the query response, such as status, query time, and query parameters. - The
responsesection contains the data retrieved from the database, including the number of documents found.
Instead of constructing the query manually in the browser as we did here, next we will use the Python package impc_api.