Dataset updates for February 2024
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
Overview of the current biological databases
The current dataset composition as of 7th March 2024 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
| Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
|---|---|---|---|---|
| AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
| CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
| ChEMBL | protein | 1 | 14,013 | 10/06/2023 00:19:35 |
| EMVec | nucleotide | 1 | 7,377 | 23/02/2024 05:19:42 |
| ENA | nucleotide | 110 | 68,137,539 | 06/03/2024 21:01:53 |
| ENA cds | nucleotide | 82 | 172,568,001 | 23/02/2024 16:20:06 |
| ENA expcon | nucleotide | 1 | 243,041 | 23/02/2024 05:29:23 |
| ENA ncr | nucleotide | 57 | 23,352,584 | 29/02/2024 01:34:48 |
| ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
| ENA spacer | nucleotide | 45 | 212,565 | 01/07/2023 02:49:57 |
| Ens | mixed | 964 | 77,361,235 | 04/12/2023 14:45:49 |
| EnsCovid | mixed | 4 | 26 | 31/12/2023 00:24:02 |
| EnsGenomes | mixed | 166,714 | 361,700,632 | 17/07/2023 16:20:02 |
| EPO | protein | 1 | 4,441,107 | 23/05/2023 17:05:48 |
| HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
| IMGTHLAcds | nucleotide | 1 | 38,904 | 15/02/2024 12:35:38 |
| IMGTHLAgen | nucleotide | 1 | 21,139 | 17/01/2024 00:27:15 |
| IMGTHLApro | protein | 1 | 38,704 | 15/02/2024 12:34:42 |
| IMGTLIGM | nucleotide | 1 | 246,842 | 25/06/2023 00:11:52 |
| IntAct | protein | 1 | 123,032 | 20/02/2024 00:33:29 |
| InterPro | protein | 1 | 40,768 | 26/01/2024 01:17:31 |
| IPDKIRcds | nucleotide | 1 | 1,534 | 16/12/2023 00:22:12 |
| IPDKIRgen | nucleotide | 1 | 880 | 16/12/2023 00:20:54 |
| IPDKIRpro | protein | 1 | 1,387 | 16/12/2023 00:21:05 |
| IPDMHCcds | nucleotide | 1 | 11,519 | 23/08/2023 00:20:29 |
| IPDMHCgen | nucleotide | 1 | 3,008 | 23/08/2023 00:20:29 |
| IPDMHCpro | protein | 1 | 11,519 | 16/06/2022 10:00:12 |
| IPDNHKIRcds | nucleotide | 1 | 1,072 | 16/01/2023 23:29:48 |
| IPDNHKIRgen | nucleotide | 1 | 13 | 16/01/2023 23:22:40 |
| IPDNHKIRpro | protein | 1 | 1,072 | 16/01/2023 23:22:40 |
| IPRMC | protein | 1 | 250,389,714 | 26/01/2024 15:15:42 |
| IPRMC_UNIPARC | protein | 1 | 1 | 29/01/2024 17:31:22 |
| JPO | protein | 1 | 5,791,725 | 27/02/2024 01:35:44 |
| KIPO | protein | 1 | 2,087,869 | 27/02/2024 01:17:22 |
| MP | protein | 1 | 1,228,767 | 16/06/2022 10:06:17 |
| MPEP | protein | 1 | 1,228,278 | 16/06/2022 10:00:29 |
| MPRO | protein | 1 | 5,098 | 07/01/2023 23:47:07 |
| PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
| Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
| PDB | protein | 1 | 766,864 | 07/03/2024 00:27:40 |
| PDBaa | protein | 1 | 766,864 | 07/03/2024 00:27:48 |
| PDBna | nucleotide | 1 | 48,658 | 07/03/2024 00:29:05 |
| Pfam | protein | 1 | 20,795 | 15/09/2023 01:30:47 |
| Rfam | nucleotide | 1 | 4,170 | 15/11/2023 12:48:45 |
| TAXONOMY | other | 1 | 1 | 06/03/2024 00:33:09 |
| TESTDB | protein | 1 | 3 | |
| TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
| UniParc | protein | 401 | 1,329,362,100 | 30/01/2024 13:57:44 |
| UniProtKB | protein | 3 | 12,694,145 | 30/01/2024 13:58:05 |
| UniProtKB Divisions | protein | 18 | 153,919,484 | 30/01/2024 13:58:42 |
| UniRef | protein | 3 | 90,514,904 | 30/01/2024 13:57:27 |
| UniVec | nucleotide | 1 | 6,113 | 26/01/2024 14:52:53 |
| USPTO | protein | 1 | 9,297,014 | 14/12/2023 01:29:21 |
| WormBase | mixed | 1,120 | 22,358,127 | 13/04/2023 12:45:13 |