Dataset updates for August 2025
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Overview of the current biological databases
The current dataset composition as 3rd September 2025 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,763 | 01/08/2025 12:02:03 |
| EMVec | nucleotide | 1 | 7,794 | 27/08/2025 08:07:41 |
| ENA | nucleotide | 117 | 99,869,940 | 28/08/2025 07:31:16 |
| ENA coding | nucleotide | 89 | 1,138,224,193 | 01/09/2025 21:49:34 |
| ENA expanded contigs | nucleotide | 1 | 243,330 | 27/08/2025 07:57:52 |
| ENA non-coding | nucleotide | 64 | 49,259,216 | 28/08/2025 01:06:29 |
| ENA rRNA | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| ENA spacer | nucleotide | 15 | 212,565 | 31/07/2025 15:12:16 |
| Ens | mixed | 20,631 | 709,027,820 | 27/08/2025 11:59:59 |
| EnsCovid | mixed | 4 | 26 | 20/02/2025 00:36:05 |
| EnsGenomes | mixed | 168,021 | 389,279,990 | 02/09/2025 09:37:21 |
| EPO | protein | 1 | 7,075,092 | 02/09/2025 00:42:14 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 43,219 | 15/07/2025 02:00:26 |
| IMGTHLAgen | nucleotide | 1 | 24,355 | 15/07/2025 01:59:47 |
| IMGTHLApro | protein | 1 | 43,002 | 15/07/2025 02:00:26 |
| IMGTLIGM | nucleotide | 1 | 251,316 | 02/09/2025 00:25:10 |
| IntAct | protein | 1 | 126,008 | 04/04/2025 00:55:10 |
| InterPro | protein | 1 | 48,679 | 20/07/2025 18:10:16 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
| IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
| IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
| IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
| IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
| IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
| IPRMC | protein | 1 | 253,702,583 | 21/07/2025 12:50:09 |
| IPRMC_UNIPARC | protein | 1 | 1 | 24/07/2025 08:51:17 |
| JPO | protein | 1 | 9,858,425 | 26/08/2025 01:14:02 |
| KIPO | protein | 1 | 2,800,446 | 26/08/2025 00:36:13 |
| MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
| MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
| MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
| PANTHER | protein | 1 | 123,151 | 29/04/2025 11:03:25 |
| Patent Equivalents | protein | 1 | 119,710 | 07/05/2025 10:08:01 |
| PDB | protein | 1 | 897,410 | 29/08/2025 09:33:27 |
| PDBaa | protein | 1 | 897,410 | 28/08/2025 00:21:07 |
| PDBna | nucleotide | 1 | 56,753 | 28/08/2025 00:23:01 |
| Pfam | protein | 1 | 24,736 | 22/07/2025 00:24:51 |
| Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
| TAXONOMY | other | 1 | 1 | 02/09/2025 00:24:45 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,964,243,476 | 18/06/2025 18:07:00 |
| UniProtKB | protein | 3 | 14,697,390 | 18/06/2025 18:06:26 |
| UniProtKB Divisions | protein | 18 | 142,988,721 | 18/06/2025 18:06:52 |
| UniRef | protein | 3 | 26,000,000 | 18/06/2025 18:06:50 |
| UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
| USPTO | protein | 1 | 10,206,785 | 03/07/2025 01:26:30 |
| WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |