The current and previous GOA files can be downloaded from the GOA ftp site The UniProt multi-species file and individual species files are available at the top level and intuitively named. If you require data for further species not listed at the top level, they can be found in the 'proteomes' folder. The type of files we provide are as follows;
File descriptions for Annotation data can be found here
N.B. As the human gene association file from GOA is no longer constructed using the International Protein Index (IPI) resource, users are now invited make use of the UniProtKB identifier mapping file . The format for this file is available in its Readme .
GO mapping files; The files listed contain the transitive assignments of InterPro matches, UniProtKB keywords, UniProtKB subcellular locations, Enzyme Commission numbers or HAMAP matches to manually-selected GO terms. The files are used in conjunction with annotated databases, such as UniProtKB to generate automatic (IEA-evidenced) GO annotations.
For more information on these files read the GOA Readme file .
Gene product association and information files; The GOA group now provides two new files that have been generated from gene_association.goa_uniprot. These files separately contain the
information specifically required to describe a GO annotation (gp_association.goa_uniprot) and to describe the proteins for which annotations are provided (gp_information.goa_uniprot). Use of these two files instead of the gene_association.goa_uniprot file has the advantage of reduced redundancy in the information supplied, resulting in a combined size of 126MB less than the gene_association.goa_uniprot file. However the format of these two files is subject to ongoing discussions by the GO Consortium, so their exact format may change over time.