Dataset updates for June 2025
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
During June 2025, we have released UniProKB version 2025_03 and InterProScan version 5.75-106.0.
Overview of the current biological databases
The current dataset composition as 30th June 2025 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,763 | 20/02/2025 00:36:05 |
| EMVec | nucleotide | 1 | 7,736 | 21/06/2025 08:42:14 |
| ENA | nucleotide | 117 | 91,016,202 | 22/06/2025 03:02:07 |
| ENA cds | nucleotide | 89 | 1,120,798,588 | 15/06/2025 17:34:16 |
| ENA expcon | nucleotide | 1 | 43,043 | 21/06/2025 08:53:50 |
| ENA ncr | nucleotide | 64 | 49,447,375 | 27/06/2025 01:20:51 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| Ens | mixed | 17,971 | 622,397,726 | 22/05/2025 10:47:33 |
| EnsCovid | mixed | 4 | 26 | 20/02/2025 00:36:05 |
| EnsGenomes | mixed | 167,888 | 377,843,991 | 07/05/2025 09:14:16 |
| EPO | protein | 1 | 6,754,787 | 24/06/2025 00:43:29 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 42,579 | 10/04/2025 00:51:54 |
| IMGTHLAgen | nucleotide | 1 | 24,355 | 10/04/2025 00:54:35 |
| IMGTHLApro | protein | 1 | 42,366 | 10/04/2025 00:55:08 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
| IntAct | protein | 1 | 126,008 | 04/04/2025 00:55:10 |
| InterPro | protein | 1 | 48,679 | 20/06/2025 01:48:10 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
| IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
| IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
| IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
| IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
| IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
| IPRMC | protein | 1 | 253,702,583 | 20/06/2025 17:47:32 |
| IPRMC_UNIPARC | protein | 1 | 1 | 26/05/2025 12:58:14 |
| JPO | protein | 1 | 9,858,425 | 03/06/2025 01:30:02 |
| KIPO | protein | 1 | 2,678,427 | 10/06/2025 06:34:19 |
| MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
| MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
| MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
| PANTHER | protein | 1 | 123,151 | 29/04/2025 11:03:25 |
| Patent Equivalents | protein | 1 | 119,710 | 07/05/2025 10:08:01 |
| PDB | protein | 1 | 881,146 | 26/06/2025 00:30:18 |
| PDBaa | protein | 1 | 881,146 | 26/06/2025 00:30:20 |
| PDBna | nucleotide | 1 | 55,598 | 26/06/2025 00:27:55 |
| Pfam | protein | 1 | 24,736 | 21/06/2025 08:25:51 |
| Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
| TAXONOMY | other | 1 | 1 | 28/06/2025 00:34:49 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,964,243,476 | 18/06/2025 18:07:00 |
| UniProtKB | protein | 3 | 14,697,390 | 18/06/2025 18:06:26 |
| UniProtKB Divisions | protein | 18 | 142,988,721 | 18/06/2025 18:06:52 |
| UniRef | protein | 3 | 26,000,000 | 18/06/2025 18:06:50 |
| UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
| USPTO | protein | 1 | 10,032,088 | 26/01/2025 01:25:07 |
| WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |