Dataset updates for January 2024
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
During January 2024, we have released UniProKB version 2024_01. We have also released Ensembl Genomes version 58.
Overview of the current biological databases
The current dataset composition as of 31st January 2024 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,013 | 10/06/2023 00:19:35 |
| EMVec | nucleotide | 1 | 7,150 | 01/07/2023 04:24:31 |
| ENA | nucleotide | 97 | 66,631,298 | 25/07/2023 19:53:24 |
| ENA cds | nucleotide | 82 | 264,491,921 | 23/12/2023 12:25:46 |
| ENA expcon | nucleotide | 1 | 570,145 | 23/08/2023 01:47:28 |
| ENA ncr | nucleotide | 57 | 23,053,333 | 28/01/2024 01:09:44 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| ENA spacer | nucleotide | 45 | 212,565 | 01/07/2023 02:49:57 |
| Ens | mixed | 964 | 77,361,235 | 04/12/2023 14:45:49 |
| EnsCovid | mixed | 4 | 26 | 31/12/2023 00:24:02 |
| EnsGenomes | mixed | 166,714 | 361,700,632 | 17/07/2023 16:20:02 |
| EPO | protein | 1 | 4,441,107 | 23/05/2023 17:05:48 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 37,611 | 17/10/2023 00:07:13 |
| IMGTHLAgen | nucleotide | 1 | 21,139 | 17/01/2024 00:27:15 |
| IMGTHLApro | protein | 1 | 37,423 | 17/10/2023 00:06:41 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 25/06/2023 00:11:52 |
| IntAct | protein | 1 | 122,971 | 10/10/2023 00:37:05 |
| InterPro | protein | 1 | 40,768 | 26/01/2024 01:17:31 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 16/12/2023 00:22:12 |
| IPDKIRgen | nucleotide | 1 | 880 | 16/12/2023 00:20:54 |
| IPDKIRpro | protein | 1 | 1,387 | 16/12/2023 00:21:05 |
| IPDMHCcds | nucleotide | 1 | 11,519 | 23/08/2023 00:20:29 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 23/08/2023 00:20:29 |
| IPDMHCpro | protein | 1 | 11,519 | 16/06/2022 10:00:12 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 16/01/2023 23:29:48 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 16/01/2023 23:22:40 |
| IPDNHKIRpro | protein | 1 | 1,072 | 16/01/2023 23:22:40 |
| IPRMC | protein | 1 | 250,389,714 | 26/01/2024 15:15:42 |
| IPRMC_UNIPARC | protein | 1 | NA | 29/01/2024 17:31:22 |
| JPO | protein | 1 | 5,791,620 | 27/01/2024 01:18:56 |
| KIPO | protein | 1 | 1,766,417 | 26/01/2024 01:37:04 |
| MP | protein | 1 | 1,228,767 | 16/06/2022 10:06:17 |
| MPEP | protein | 1 | 1,228,278 | 16/06/2022 10:00:29 |
| MPRO | protein | 1 | 5,098 | 07/01/2023 23:47:07 |
| PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
| Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
| PDB | protein | 1 | 757,162 | 31/01/2024 00:45:35 |
| PDBaa | protein | 1 | 757,162 | 31/01/2024 00:43:08 |
| PDBna | nucleotide | 1 | 48,174 | 31/01/2024 00:43:37 |
| Pfam | protein | 1 | 20,795 | 15/09/2023 01:30:47 |
| Rfam | nucleotide | 1 | 4,170 | 15/11/2023 12:48:45 |
| TAXONOMY | other | 1 | 1 | 30/01/2024 00:24:12 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,329,362,100 | 30/01/2024 13:57:44 |
| UniProtKB | protein | 3 | 12,694,145 | 30/01/2024 13:58:05 |
| UniProtKB Divisions | protein | 18 | 153,919,484 | 30/01/2024 13:58:42 |
| UniRef | protein | 3 | 90,514,904 | 30/01/2024 13:57:27 |
| UniVec | nucleotide | 1 | 6,113 | 26/01/2024 14:52:53 |
| USPTO | protein | 1 | 9,297,014 | 14/12/2023 01:29:21 |
| WormBase | mixed | 1,120 | 22,358,127 | 13/04/2023 12:45:13 |