- Course overview
- Search within this course
- Overview of key IMPC concepts and tools
- Solr query syntax: simplified explanation
- How to use the solr_request function from the impc-api python package
- How to perform a query: q parameter
- Exercise 1: getting familiar with the core
- How to request a limited number of documents: rows parameter
- Exercise 2: requesting three documents
- How to get specific fields: fl parameter
- Exercise 3: selecting specific fields
- Quiz 4: basic Solr parameters
- Filtering data in Solr: narrowing down your results
- How to query a specific field: filter by value
- Exercise 4: filtering by a single field
- How to filter numbers: range search
- Exercise 5: changing the p-value threshold
- How to combine multiple filters: Boolean operators
- Exercise 6: applying multiple filters
- How to exclude data: NOT operator
- Why parentheses are important: combine multiple Boolean operators
- Quiz 5: Boolean operators
- How to handle with null values: exclude empty fields
- Exercise 7: explore null values
- Downloading data: getting large results efficiently
- How to download large dataset effectively: pagination
- How to download the data: batch_solr_request function
- What formats are available for downloading: wt parameter
- Exercise 8: download the data
- What is the difference: JSON vs CSV
- What you need to keep in mind: query responsibly
- Quiz 6: request only necessary data
- Advanced Solr query techniques: faceting and iterating over entities
- Understanding IMPC data: resources and assistance
- Your feedback
Important definitions: query, field, core, document, parameter
- Query: A request for data.
- Core: A “self-contained database” within Solr, each with its own schema, configuration, and data. For IMPC data, several Solr cores are available:
experiment,statistical-result,genotype-phenotype,phenodigm, andimpc_images. A more detailed description of each can be found here. - Document: A single, structured unit of data that Solr stores. It is similar to a row in a database table. Each document consists of multiple fields.
- Field: A specific piece of data or attribute within a document, similar to a column in a database table.
- Parameter: In the context of Solr, a parameter is a setting or input that influences how a query is processed. q and rows are examples of Solr parameters.
Note: The IMPC parameter and Solr parameter have different meanings.
Continue to the next page to test your learning so far.