Dataset updates for November 2024
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
During November 2024, we have released UniProKB version 2024_06. We have also released InterProScan version 5.72-103.0.
Overview of the current biological databases
The current dataset composition as of 3rd December 2024 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,321 | 05/09/2024 14:34:46 |
| EMVec | nucleotide | 1 | 7,561 | 17/07/2024 04:59:53 |
| ENA | nucleotide | 115 | 78,584,200 | 18/07/2024 19:50:37 |
| ENA cds | nucleotide | 88 | 394,866,750 | 19/07/2024 17:34:29 |
| ENA expcon | nucleotide | 1 | 43,043 | 17/07/2024 04:49:43 |
| ENA ncr | nucleotide | 64 | 49,521,585 | 28/11/2024 01:08:56 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| Ens | mixed | 13,796 | 383,195,110 | 25/10/2024 08:31:58 |
| EnsCovid | mixed | 4 | 26 | 31/12/2023 00:24:02 |
| EnsGenomes | mixed | 167,321 | 367,524,186 | 17/10/2024 10:51:38 |
| EPO | protein | 1 | 6,030,246 | 03/12/2024 00:54:20 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 41,428 | 11/10/2024 00:25:10 |
| IMGTHLAgen | nucleotide | 1 | 23,373 | 11/10/2024 00:26:29 |
| IMGTHLApro | protein | 1 | 41,215 | 11/10/2024 00:24:50 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
| IntAct | protein | 1 | 124,556 | 19/09/2024 00:38:24 |
| InterPro | protein | 1 | 47,049 | 29/11/2024 01:10:54 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
| IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
| IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
| IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
| IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
| IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
| IPRMC | protein | 1 | 254,322,157 | 29/11/2024 12:22:35 |
| IPRMC_UNIPARC | protein | 1 | 1 | 10/10/2024 09:49:39 |
| JPO | protein | 1 | 6,565,137 | 29/11/2024 01:46:51 |
| KIPO | protein | 1 | 2,492,774 | 29/11/2024 01:32:06 |
| MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
| MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
| MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
| PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
| Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
| PDB | protein | 1 | 826,980 | 28/11/2024 00:23:31 |
| PDBaa | protein | 1 | 826,980 | 28/11/2024 00:22:51 |
| PDBna | nucleotide | 1 | 52,472 | 28/11/2024 00:22:03 |
| Pfam | protein | 1 | 23,794 | 29/11/2024 01:11:48 |
| Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
| TAXONOMY | other | 1 | 1 | 03/12/2024 00:34:10 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,742,938,134 | 28/11/2024 10:19:13 |
| UniProtKB | protein | 3 | 14,696,273 | 28/11/2024 10:18:43 |
| UniProtKB Divisions | protein | 18 | 134,632,325 | 28/11/2024 10:19:04 |
| UniRef | protein | 3 | 36,000,000 | 28/11/2024 10:19:03 |
| UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
| USPTO | protein | 1 | 9,733,759 | 12/07/2024 01:39:53 |
| WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |