Dataset updates for March 2025
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Overview of the current biological databases
The current dataset composition as of 3rd April 2025 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,763 | 20/02/2025 00:36:05 |
| EMVec | nucleotide | 1 | 7,704 | 19/03/2025 01:50:49 |
| ENA | nucleotide | 117 | 93,432,145 | 21/03/2025 04:05:28 |
| ENA cds | nucleotide | 88 | 380,705,115 | 20/03/2025 18:32:47 |
| ENA expcon | nucleotide | 1 | 243,044 | 20/03/2025 09:07:29 |
| ENA ncr | nucleotide | 64 | 50,147,132 | 27/03/2025 01:48:01 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| Ens | mixed | 14,766 | 409,062,290 | 03/03/2025 16:23:49 |
| EnsCovid | mixed | 4 | 26 | 20/02/2025 00:36:05 |
| EnsGenomes | mixed | 167,863 | 377,763,767 | 17/10/2024 10:51:38 |
| EPO | protein | 1 | 6,516,768 | 01/04/2025 01:03:09 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 42,017 | 26/02/2025 08:13:51 |
| IMGTHLAgen | nucleotide | 1 | 23,373 | 26/02/2025 08:14:09 |
| IMGTHLApro | protein | 1 | 41,804 | 26/02/2025 08:13:38 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
| IntAct | protein | 1 | 124,910 | 10/12/2024 00:25:17 |
| InterPro | protein | 1 | 47,677 | 07/02/2025 01:50:05 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
| IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
| IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
| IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
| IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
| IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
| IPRMC | protein | 1 | 253,273,355 | 07/02/2025 20:07:27 |
| IPRMC_UNIPARC | protein | 1 | 1 | 19/02/2025 11:06:52 |
| JPO | protein | 1 | 7,647,120 | 29/01/2025 01:33:16 |
| KIPO | protein | 1 | 2,678,427 | 29/01/2025 00:59:08 |
| MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
| MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
| MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
| PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
| Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
| PDB | protein | 1 | 859,488 | 03/04/2025 00:50:08 |
| PDBaa | protein | 1 | 859,488 | 03/04/2025 00:50:08 |
| PDBna | nucleotide | 1 | 54,283 | 03/04/2025 00:48:55 |
| Pfam | protein | 1 | 24,076 | 20/03/2025 08:51:52 |
| Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
| TAXONOMY | other | 1 | 1 | 03/04/2025 00:48:22 |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 400 | 1,833,742,494 | 18/02/2025 09:22:02 |
| UniProtKB | protein | 3 | 10,696,642 | 17/02/2025 10:52:31 |
| UniProtKB Divisions | protein | 18 | 175,172,742 | 17/02/2025 10:52:48 |
| UniRef | protein | 3 | 30,000,000 | 17/02/2025 10:52:54 |
| UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
| USPTO | protein | 1 | 10,032,088 | 26/01/2025 01:25:07 |
| WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |