Dataset updates for November 2023
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
During November 2023, we have released UniProKB version 2023_05.
Overview of the current biological databases
The current dataset composition as of 7th December 2023 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,013 | 10/06/2023 00:19:35 |
| EMVec | nucleotide | 1 | 7,150 | 01/07/2023 04:24:31 |
| ENA | nucleotide | 97 | 66,631,298 | 25/07/2023 19:53:24 |
| ENA cds | nucleotide | 79 | 82,874,352 | 16/10/2023 06:11:07 |
| ENA expcon | nucleotide | 1 | 570,145 | 23/08/2023 01:47:28 |
| ENA ncr | nucleotide | 55 | 20,138,678 | 27/09/2023 00:23:58 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| ENA spacer | nucleotide | 45 | 212,565 | 01/07/2023 02:49:57 |
| Ens | mixed | 964 | 77,361,235 | 04/12/2023 14:45:49 |
| EnsCovid | mixed | 4 | 26 | 29/08/2023 00:00:40 |
| EnsGenomes | mixed | 260,394 | 612,399,500 | 17/07/2023 16:20:02 |
| EPO | protein | 1 | 4,441,107 | 23/05/2023 17:05:48 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 37,611 | 17/10/2023 00:07:13 |
| IMGTHLAgen | nucleotide | 1 | 20,027 | 17/10/2023 00:07:43 |
| IMGTHLApro | protein | 1 | 37,423 | 17/10/2023 00:06:41 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 25/06/2023 00:11:52 |
| IntAct | protein | 1 | 122,971 | 10/10/2023 00:37:05 |
| InterPro | protein | 1 | 40,562 | 10/11/2023 00:46:46 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 16/06/2022 10:05:51 |
| IPDKIRgen | nucleotide | 1 | 880 | 23/08/2023 00:20:39 |
| IPDKIRpro | protein | 1 | 1,387 | 04/08/2022 14:46:39 |
| IPDMHCcds | nucleotide | 1 | 11,519 | 23/08/2023 00:20:29 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 23/08/2023 00:20:29 |
| IPDMHCpro | protein | 1 | 11,519 | 16/06/2022 10:00:12 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 16/01/2023 23:29:48 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 16/01/2023 23:22:40 |
| IPDNHKIRpro | protein | 1 | 1,072 | 16/01/2023 23:22:40 |
| IPRMC | protein | 1 | 251,768,942 | 10/11/2023 13:01:45 |
| IPRMC_UNIPARC | protein | 1 | 1 | 13/11/2023 21:12:31 |
| JPO | protein | 1 | 5,785,210 | 26/07/2023 02:12:59 |
| KIPO | protein | 1 | 1,766,417 | 25/07/2023 02:13:49 |
| MP | protein | 1 | 1,228,767 | 16/06/2022 10:06:17 |
| MPEP | protein | 1 | 1,228,278 | 16/06/2022 10:00:29 |
| MPRO | protein | 1 | 5,098 | 07/01/2023 23:47:07 |
| PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
| Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
| PDB | protein | 1 | 745,855 | 09/11/2023 00:07:17 |
| PDBaa | protein | 1 | 745,855 | 09/11/2023 00:08:16 |
| PDBna | nucleotide | 1 | 47,436 | 09/11/2023 00:07:14 |
| Pfam | protein | 1 | 20,795 | 15/09/2023 01:30:47 |
| Rfam | nucleotide | 1 | 4,170 | 15/11/2023 12:48:45 |
| TAXONOMY | other | 1 | 1 | 27/07/2023 00:39:04 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,264,336,020 | 10/11/2023 09:57:30 |
| UniProtKB | protein | 3 | 16,693,510 | 10/11/2023 09:56:47 |
| UniProtKB Divisions | protein | 18 | 157,157,380 | 10/11/2023 09:57:32 |
| UniRef | protein | 3 | 40,000,000 | 10/11/2023 09:57:22 |
| UniVec | nucleotide | 1 | 6,113 | 20/07/2022 16:25:46 |
| USPTO | protein | 1 | 6,674,765 | 17/05/2023 01:17:42 |
| WormBase | mixed | 959 | 19,488,094 | 13/04/2023 12:45:13 |