EBI Search Advanced Documentation
Here are the answers to some more advanced questions you may have about using the EBI search, or technical questions about its implementation.
The search box supports an advanced query syntax to allow you to construct specific queries. By default, search terms separated by spaces are interpreted as a boolean "AND" operation, and specifying "AND" between terms has the same effect:
Specifying "OR" between terms locates results containing either term:
Specifying "NOT", or prepending a minus (-) character to a term, searches for records without that term. By contrast, prepending a plus (+) character requires the term to be present:
Wildcards can be used instead of alphanumeric characters to expand the matches for a search term. The "*" character matches any number of characters, whereas the "?" character matches a single character:
Enclosing a series of words in double quotes searches for an exact phrase. Note that boolean operators and wildcards have no effect in a phrase:
You may use parentheses to control the combination of multiple AND/OR/NOT operators:
It is also possible to perform database-specific searches, and search individual fields of a database's records. See the advanced search section for more information.
The "advanced search" link adjacent to the main search box provides an interface to easily specify complex queries. There are separate input fields for:
You may also restrict your search to a specific database by clicking the "domain-specific search" link. This allows you to choose a domain (a database, or categories of databases), to restrict the search to. After selecting the domain, you can add your search terms as normal.
If you are searching a single database, you can further refine the search by selecting only specific fields from that database to search. For example, if searching PDBe you can choose to search only the "authors" field. Selecting multiple fields will look for your search terms in any of the chosen fields.
Similar to the "field search", you can also search a single database for entries that have cross-references to record identifiers in a second database. For example, selecting the "UniProt" cross-reference in the "Ensembl Gene" domain allows you to search for Ensembl genes which have a cross reference to a specific UniProt identifier.
The search engine can be accessed over the web or programmatically using a SOAP Web Services interface. This allows its search and retrieval capabilities to be exploited in workflows and analytical pipe-lines. See the Search Web Services API.
The above SOAP API does not include the gene and protein summaries. However the data for these are available as a series of REST XML web services in the form of Distributed Annotation System (DAS) sources. These can be accessed as follows:
DAS sources for the gene section
DAS source for the expression section
DAS sources for the protein section
DAS sources for the protein structure section
DAS sources for the literature section