Downloads

The current and previous UniProt-GOA files can be downloaded from the  UniProt-GOA ftp site The UniProt multi-species file and individual species files are available at the top level and intuitively named. If you require data for further species not listed at the top level, they can be found in the 'proteomes' folder. The type of files we provide are as follows;

Gene Association Files; Tab-delimited files of associations between gene products, GO terms and associated annotation information. The UniProt-GOA project uses GAF2.0, which is the standard annotation format used within the GO Consortium.

For more information on the format and contents of these files please read the UniProt-GOA Readme.

N.B. As the human gene association file from UniProt-GOA is no longer constructed using the International Protein Index (IPI) resource, users are now invited make use of the UniProtKB identifier mapping file . The format for this file is available in its Readme .

GO mapping files; The files listed contain the transitive assignments of InterPro matches, UniProtKB keywords, UniProtKB subcellular locations, Enzyme Commission numbers or HAMAP matches to manually-selected GO terms. The files are used in conjunction with annotated databases, such as UniProtKB to generate automatic (IEA-evidenced) GO annotations.

For more information on these files read the UniProt-GOA Readme file .

Gene product association and information files; The UniProt-GOA group now provides two new files that have been generated from gene_association.goa_uniprot. These files separately contain the 
information specifically required to describe a GO annotation (gp_association.goa_uniprot) and to describe the proteins for which annotations are provided (gp_information.goa_uniprot). Use of these two files instead of the gene_association.goa_uniprot file has the advantage of reduced redundancy in the information supplied, resulting in a combined size of 126MB less than the gene_association.goa_uniprot file. However the format of these two files is subject to ongoing discussions by the GO Consortium, so their exact format may change over time.

Set of Proteins Gene Association File Identifier Cross-Reference File
UniProtKB Gene association UniProtKB ID mapping | Readme
Human Gene association UniProtKB ID mapping | Readme
Mouse Gene association UniProtKB ID mapping | Readme
Rat Gene association UniProtKB ID mapping | Readme
Arabidopsis Gene association UniProtKB ID mapping | Readme
Zebrafish Gene association UniProtKB ID mapping | Readme
Chicken Gene association UniProtKB ID mapping | Readme
Cow Gene association UniProtKB ID mapping | Readme
Dicty Gene association UniProtKB ID mapping | Readme
Dog Gene association UniProtKB ID mapping | Readme
Pig Gene association UniProtKB ID mapping | Readme
Fly Gene association UniProtKB ID mapping | Readme
Worm Gene association UniProtKB ID mapping | Readme
Yeast Gene association UniProtKB ID mapping | Readme
Proteomes Gene association Browse n/a
PDB Gene association n/a
Mapping Files
InterPro2GO
UniProtKB-KW2GO
UniProtKB-SL2GO
EC2GO
HAMAP2GO
UniPathway2GO
Other
Gene product association file | Readme
Gene product information file | Readme



spacer