0%

The UniProt databases

There are three UniProt databases:

  • The UniProt Knowledgebase (UniProtKB)
  • The UniProt Reference Clusters (UniRef)
  • The UniProt Archive (UniParc)

In this tutorial we will only focus on UniProtKB. For more information on UniRef or UniParc, visit the UniProt Quick Tour.

UniProtKB

UniProtKB is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotation. It consists of two sections:

  • Reviewed (UniProtKB/Swiss-Prot) – contains manually annotated records with data added by expert biocurators giving information on protein function, structure, subcellular location, molecular interactions and much more. Each entry in UniProtKB/Swiss-Prot represents a single, non redundant gene from a specific organism and all proteins and peptides transcribed by that gene are described within the record.
  • Unreviewed (UniProtKB/TrEMBL) – contains computationally analysed records with additional information transferred from related well-annotated records in UniProtKB/Swiss-Prot by automatic annotation processes. All identical, full-length protein sequences from the same species are represented in a single record while fragments, isoforms and variants encoded by the same gene are stored in separate entries.
Planned UniProtKB change
In 2026, UniProtKB will be restricted to containing only those sequences which are part of a reference proteome as well as entries in the reviewed UniProtKB/Swiss-Prot section and entries containing additional biologically important data such as a 3D structure. More information about this change is available in the UniProt blog.