Dataset updates for May 2025
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Overview of the current biological databases
The current dataset composition as 9th June 2025 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,763 | 20/02/2025 00:36:05 |
| EMVec | nucleotide | 1 | 7,734 | 17/05/2025 10:43:56 |
| ENA | nucleotide | 117 | 91,602,121 | 19/05/2025 00:46:42 |
| ENA cds | nucleotide | 89 | 1,120,800,530 | 05/06/2025 22:15:36 |
| ENA expcon | nucleotide | 1 | 43,041 | 17/05/2025 10:33:03 |
| ENA ncr | nucleotide | 64 | 50,516,783 | 28/05/2025 01:14:09 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| Ens | mixed | 17,649 | 604,477,008 | 22/05/2025 10:47:33 |
| EnsCovid | mixed | 4 | 26 | 20/02/2025 00:36:05 |
| EnsGenomes | mixed | 167,888 | 377,843,991 | 07/05/2025 09:14:16 |
| EPO | protein | 1 | 6,655,089 | 03/06/2025 00:41:48 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 42,579 | 10/04/2025 00:51:54 |
| IMGTHLAgen | nucleotide | 1 | 24,355 | 10/04/2025 00:54:35 |
| IMGTHLApro | protein | 1 | 42,366 | 10/04/2025 00:55:08 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
| IntAct | protein | 1 | 126,008 | 04/04/2025 00:55:10 |
| InterPro | protein | 1 | 48,003 | 12/05/2025 11:55:04 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
| IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
| IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
| IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
| IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
| IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
| IPRMC | protein | 1 | 252,828,976 | 12/05/2025 11:55:12 |
| IPRMC_UNIPARC | protein | 1 | 1 | 26/05/2025 12:58:14 |
| JPO | protein | 1 | 9,858,425 | 03/06/2025 01:30:02 |
| KIPO | protein | 1 | 2,678,427 | 08/04/2025 01:15:54 |
| MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
| MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
| MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
| PANTHER | protein | 1 | 123,151 | 29/04/2025 11:03:25 |
| Patent Equivalents | protein | 1 | 119,710 | 07/05/2025 10:08:01 |
| PDB | protein | 1 | 875,551 | 05/06/2025 02:04:05 |
| PDBaa | protein | 1 | 875,551 | 05/06/2025 02:05:48 |
| PDBna | nucleotide | 1 | 55,145 | 05/06/2025 02:04:05 |
| Pfam | protein | 1 | 24,424 | 25/04/2025 03:28:52 |
| Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
| TAXONOMY | other | 1 | 1 | 07/06/2025 00:37:32 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 1 | 330,738 | 24/04/2025 12:11:59 |
| UniProtKB | protein | 3 | 12,696,974 | 24/04/2025 12:11:28 |
| UniProtKB Divisions | protein | 18 | 100,358,922 | 24/04/2025 12:11:44 |
| UniRef | protein | 3 | 30,000,000 | 24/04/2025 12:11:46 |
| UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
| USPTO | protein | 1 | 10,032,088 | 26/01/2025 01:25:07 |
| WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |