The current and previous UniProt-GOA files can be downloaded from the UniProt-GOA ftp site The UniProt multi-species file and individual species files are available at the top level and intuitively named. If you require data for further species not listed at the top level, they can be found in the 'proteomes' folder. The type of files we provide are as follows;
Gene Association Files; Tab-delimited files of associations between gene products, GO terms and associated annotation information. The UniProt-GOA project uses GAF2.0, which is the standard annotation format used within the GO Consortium.
For more information on the format and contents of these files please read the UniProt-GOA Readme.
N.B. As the human gene association file from UniProt-GOA is no longer constructed using the International Protein Index (IPI) resource, users are now invited make use of the UniProtKB identifier mapping file . The format for this file is available in its Readme .
GO mapping files; The files listed contain the transitive assignments of InterPro matches, UniProtKB keywords, UniProtKB subcellular locations, Enzyme Commission numbers or HAMAP matches to manually-selected GO terms. The files are used in conjunction with annotated databases, such as UniProtKB to generate automatic (IEA-evidenced) GO annotations.
For more information on these files read the UniProt-GOA Readme file .
Gene product association and information files; The UniProt-GOA group now provides two new files that have been generated from gene_association.goa_uniprot. These files separately contain the
information specifically required to describe a GO annotation (gp_association.goa_uniprot) and to describe the proteins for which annotations are provided (gp_information.goa_uniprot). Use of these two files instead of the gene_association.goa_uniprot file has the advantage of reduced redundancy in the information supplied, resulting in a combined size of 126MB less than the gene_association.goa_uniprot file. However the format of these two files is subject to ongoing discussions by the GO Consortium, so their exact format may change over time.