![]() |
Data File Formats for ArrayExpress SubmissionIn this help page:
OverviewThis diagram shows how the raw data files, normalized data files and final gene expression matrix are related in an ArrayExpress experiment submission. Normalized data can be submitted as one file per hyb, or in a final gene expression matrix.
Supported file types by submission tool
Raw data is required for all submissions. The normalized files and combined data file (final gene expression matrix) are optional but you should provide at least one of these file types for your submission to be fully MIAME compliant.
Raw data filesDO NOT EDIT YOUR RAW DATA FILES. For Affymetrix the raw data is the CEL file. For other platforms the raw data is the file which contains the signal intensities, background intensities, etc, for every spot on the array, e.g. GenePix .gpr file, Agilent Feature Extraction software .txt file. Our submission tools support raw data from the software listed below:
If your data is not from one of these programs then we will probably be able to convert it to a format that is supported. Please note that we are only able to handle files that can be read by a standard text editor (with the exception of binary CEL files). If you are not sure what files to provide you can email us at either attaching an example file to the email or FTP the example file to us.
Normalized data filesApplying a normalization algorithm to a raw data file, for example print tip normalization, produces a normalized file. Submitted normalized data files must contain data from a single hybridization only. If your normalization procedure creates a file containing data from all your hybs then you can submit this as a final gene expression matrix (FGEM). A normalization protocol should be submitted along with your normalized data files. Please make sure your normalization protocol contains enough information to allow users to understand what the data in your normalized files means. Be precise when describing how the data was calculated, e.g. 'log ratio' is not enough information for MIAME compliance, we need to know what kind of log it is (log2, log10, loge etc). Affymetrix per-hyb normalized dataFor Affymetrix submissions you can submit the CHP file, or a text file from some other software as per-hyb normalized data. Each line in the text files must correspond to an Affymetrix probe set, the probe set ID (called a CompositeSequence Identifier by us) must be provided in the first column of the file, e.g. -Example of a single hyb GC-RMA normalized data file Other per-hyb normalized dataThe per-hyb normalized data can contain either:
-Example of a lowess normalized data file with MetaColumn, MetaRow, Column, Row coordinates -Example of a median normalized data file with Reporter Identifiers
Final gene expression matrix (FGEM)A final gene expression matrix (FGEM) or combined data file is a file containing data from several hybridizations. It can be created by any data processing or spreadsheet software but must be saved as a tab delimited .txt file. MIAMExpress allows you to upload only 1 FGEM per experiment. Tab2MAGE and MAGE-TAB allow you to upload multiple FGEMs per experiment. The creation of your FGEM must be described in your transformation protocol. Be precise when describing how the data was calculated, e.g. 'log ratio' is not enough information for MIAME compliance, we need to know what kind of log it is (log2, log10, loge etc). The format of the FGEM is as follows:
Explanation of columns in an FGEM:
Examples for download (files truncated for faster download):
Sending files by FTPThe email account cannot receive large attachments so if you need to send several files to us prior to submission (e.g. for us to check they are in a suitable format) then you can put them on our FTP site. After putting them on the FTP site email and tell us the name of the file transferred. Data files placed on the FTP site are NOT submitted to ArrayExpress. To submit files to ArrayExpress you must upload them using either the MIAMExpress or Tab2MAGE submission tools. To transfer files using Windows, open Windows Explorer and enter ftp://aexpress@ftp1.ebi.ac.uk/. You will be asked to login. The login and password are aexpress. After logging in you can drag and drop your files across. Note: you will not be able to see any files or directories already on the FTP site. To transfer files using Unix, a Mac terminal window or the windows command prompt connect to the FTP server using the command ftp ftp1.ebi.ac.uk. Username and password are: aexpress. Use the put command to place your file (or mput for multiple files) into the default directory. Please ensure that you use unique file names. To exit FTP, type quit. On exiting you will get a message printed to screen to tell you whether your transfer was successful. Note: you will not be able to list the files in the directory or download files from the FTP site to your directory. . Any further questions, please see our FAQ. |
||||||||||||||||||||||||||||||||||