spacer
spacer

Searching ArrayExpress

1. Searching the Experiment Archive
  1.1 Accession and keyword searches
  1.2 Advanced searches
    1.2.1 Filter experiments to show data directly submitted to ArrayExpress only (not GEO-imported data)
    1.2.2 Filter experiments by species, array design, molecule or technology
    1.2.3 Combining search terms with AND, OR and NOT
    1.2.4 Specifying fields for searches
    1.2.5 Filtering experiments by counts of assays, samples, experimental factors etc.
  1.3 Login to view private data
  1.4 RSS feed
  1.5 Downloading data/FTP archives and programmatic access
    1.5.1 Files available from the FTP site
    1.5.2 MAGE-TAB format description
    1.5.3 Programmatic access
  1.6 Understanding archive search results
 
2. Searching the Gene Expression Atlas
  2.1 Atlas searches
   

 

 

1. Searching the Experiments Archive

Top

1.1. Accession and keyword searches

  1. enter an experiment accession number or keyword (e.g. RNAi, breast cancer) in the query box on the left-hand panel on the ArrayExpress home page http://www.ebi.ac.uk/arrayexpress, or in the query box on a results page

Archive browse experiments image

Archive query box image

  1. all experiments where your term is found in any of these fields will be returned:
    1. ArrayExpress accession number e.g. E-MEXP-568
    2. secondary accession numbers e.g. GEO series accession GSE5389, ENA study accession number ERP000054
    3. experiment name
    4. submitter's experiment description
    5. sample and experimental factor attribute classifiers and values, including species (e.g. GeneticModification, Mus musculus, DREB2C over-expression)
    6. publication title, authors and journal name, PubMed ID
    7. array design name and accession
  1. synonyms for terms are always included in searches e.g. 'human' and 'Homo sapiens'
  2. a drop down menu will show matching terms in the Experimental Factor Ontology (marked EFO) or terms that exist in any record. The Experimental Factor Ontology is an application-focused ontology modelling the experimental factors in ArrayExpress. The Experimental Factor Ontology expansion affects values that are experiment types, sample attributes, experimental factor values and species. In the search results exact matches are highlighted yellow, synonyms green and child terms pink. See the search results help page here: Understanding archive browse/search results.
Image of archive query expansion

 

  1. use * as a wildcard for 0 or more characters and ? for single characters e.g. embryo* will retrieve results with matches to embryo and embyronic, te?t will search for test and text. Wildcards will not work within phrases (see below)
  2. put quotes around phrases where you want to search for more than one word together e.g. "bone marrow"
  3. US spelling conventions are used e.g. leukemia not leukaemia, although common terms are searched for in both UK and US spellings
  4. ArrayExpress uses latin names for species e.g Homo sapiens. For some species a search for the common name will bring up results but search for the latin name to be sure you find all relevant experiments.
  5. Non-standard character sets are not supported, e.g. greek symbols
  6. To browse all experiments in ArrayExpress click on the 'Browse experiments' link.

The search results page is described here: Understanding archive browse/search results.

Top

1.2. Advanced searches

1.2.1 Filter experiments to show data directly submitted to ArrayExpress only (not GEO-imported data)

We import data from the Gene Expression Omnibus (GEO, www.ncbi.nlm.nih.gov/geo/). To limit the search to only the experiments submitted directly to ArrayExpress and not imported from GEO check the box under the search box. For more information about how we import data from GEO see the GEO data help page.

 

image of text box for AE data only

Top

1.2.2 Filter experiments by species, array design, molecule or technology

Experiments can be filtered by species, array design, molecule (DNA, RNA, metabolite, protein) or technology (array, high-throughput sequencing, mass spectrometry) using the drop down menus in the centre of the top search option bar. After selecting a filter, click on the 'Query' box on the right hand side to filter experiments. To remove a filter, either re-select the top option from the list, or click on the '[reset]' link, and then click on the 'Query' box again to requery ArrayExpress.

 

Filtering options

Top

1.2.3. Combining search terms

Enter two or more keywords in the search box with the operators AND, OR or NOT. AND is the default search term; a search for 'prostate breast' will return hits with a match to 'prostate' AND 'breast'.

Search terms of more than one word must be entered inside quotes otherwise only the first word will be searched for. E.g. transcription AND Rattus norvegicus will effectively be a search for transcription AND Rattus.

If a field is not specified (see below) then the term is search in any of the experiment fields (experiment description, sample annotation, citation etc).

 

Operator Searches Example
AND Experiments with more than one term. This is the default term AND query
OR Experiments with either term. OR query
NOT Experiments without a term NOT query

 

Top

 

1.2.4. Specifying fields for searches

Particular fields for searching can also be specified in the format of fieldname:value. Again, phrases of more than one word must be entered in quotes otherwise only the first word will be searched for. The fields that can be searched are shown in the table below.

 

Field name Searches Example
accession Experiment primary or secondary accession accession query
array Array design accession or name array query
ef Experimental factor, the name of the main variables in an experiment. experimental factor query
efv Experimental factor value. Has EFO expansion. experimental factor value query
expdesign Experiment design type experiment design type query
exptype Experiment type. Has EFO expansion. experiment type category
gxa Presence in the Gene Expression Atlas. Only value is gxa:true. atlas query
pmid PubMed identifier pubmed query
sa

Sample attribute values. Has EFO expansion.

sample attribute query
species Species of the samples. Has EFO expansion. species query

Top

 

1.2.5. Filtering experiments by counts of a particular attribute

Experiments fulfilling certain count critera can also be searched for e.g. having more than 10 assays (hybridizations). These searches use the following syntax:

 

Filter What is filtered

assaycount:[x TO y]

filter on the number of of assays where x <= y and both values are between 0 and 99,999 (inclusive) . To count excluding the values given use curly brackets e.g. assaycount:{1 TO 5} will find experiments with 2-4 assays. Single numbers may also be given e.g. assaycount:10 will find experiments with 10 assays.
efcount:[x TO y] filter on the number of experimental factors
samplecount:[x TO y] filter on the number of samples
sacount:[x TO y] filter on the number of sample attribute categories
rawcount:[x TO y] filter on the number of raw files
fgemcount:[x TO y] filter on the number of final gene expression matrix (processed data) files
miamescore:[x TO y] filter on the MIAME compliance score (maximum score is 5)
date:yyyy-mm-dd

filter by release date

  • date:2009-12-01 - will search for experiments released on 1st of Dec 2009
  • date:2009* - will search for experiments released in 2009
  • date:[2008-01-01 2008-05-31] - will search for experiments released between 1st of Jan and end of May 2008

Examples

Search term What is searched
leukemia AND species:"homo sapiens" AND exptype:"transcription profiling" AND assaycount:[10 TO 99999] Transcription profiling experiments that mention the word 'leukemia' in any field, use human samples and have at least 10 assays
species:"Arabidopsis thaliana" AND NOT array:Affymetrix* AND fgemcount:[1 TO 99999] Arabidopsis experiments that are not on an Affymetrix array, but that have processed data files
species:Saccharomyces* AND date:[2009-06-01 2010*] Yeast experiments released between June 2009 and 2010

1.3 Login to view private data

Data can be kept private in ArrayExpress until an associated paper is published. After a custom array design or experiment is loaded into ArrayExpress the submitter is sent details of login accounts for themselves, and for journal editors and reviewers so that they can the view data before it is publicly available. Data is made public when the submitter gives us permission to do so, or if we find that the data has been referenced in a published article.

Submitters and reviewers can login to view their private experiments and array designs by clicking on the 'Submitter/reviewer login' link in the browse interface (www.ebi.ac.uk/arrayexpress) . This will take you to a login box. Enter the login details we have provided you with.

Login box

If the 'Remember me' box is not checked, then you will remain logged in until the browser is closed.

If you have forgotten your ArrayExpress login and password, email the curation team at miamexpress@ebi.ac.uk to get your login details by email. Please specify the accession number of the experiment or array design you wish view.

If you are a reviewer and have not been provided with an ArrayExpress login to access private data connected to a publication please contact the data submitter, via the journal, to request this information. We cannot provide access to private data to anyone without first getting authorization from the submitter or journal.

If you have submitted data using our MIAMExpress or MAGE-TAB submission tools please note that your submitter login account cannot be used to login to ArrayExpress. You will be sent a separate ArrayExpress reviewer login account when the processing of your submission is complete.

Top

 

1.4 RSS feed

We provide an RSS service listing experiments as they become public in the ArrayExpress archive so that you can be aware of new experiments that may be of interest to you. The URL for the RSS feed is http://www.ebi.ac.uk/arrayexpress/rss/v2/experiments or you can click on the orange RSS icon on the ArrayExpress home page.

Top

 

1.5 Downloading data and programmatic access for the archive and atlas

All data, experiment descriptions and array annotation in the ArrayExpress experiment archive can be downloaded from our FTP site in MAGE-TAB formats. See the following help pages:

 

1.5.1 Files available

Information about the files available for each experiment can be found here ArrayExpress FTP downloads
 

1.5.2 MAGE-TAB format files

The MAGE-TAB format is described on this page - MAGE-TAB files
 

1.5.3 Programmatic access

How to access the experiment archive and gene expression atlas programmatically is provided here - Programmatic Access

Top

1.6 Understanding Archive search results

See the Search Results page for help about the information displayed about each experiment.

Top

 

2. Searching the Atlas of Gene Expresssion

2.1. Atlas searches

See the Atlas-specific help page for information about searching the Atlas of Gene Expression

 

 

Top

Any further questions, please see our FAQ.

spacer
spacer