Dbfetch Databases
Introduction
The databases available via dbfetch are listed below, the name in parenthesis should be used when:
- Constructing a Dbfetch URL (see Syntax).
- Constructing an identifier file for upload (see Search Items).
- Making a request via the web services (see WSDbfetch).
An overview of each database is also provided, which includes a short description of the database, a link to the database, a collection of example identifiers and details of the available data formats and result styles.
Databases
- AlphaFold DB (afdb)
- CDP (cdp)
- ChEMBL Targets (chembl)
- EDAM (edam)
- EMDB (emdb)
- ENA Coding (ena_coding)
- ENA Geospatial (ena_geospatial)
- ENA Non-coding (ena_noncoding)
- ENA rRNA (ena_rrna)
- ENA Sequence (ena_sequence)
- ENA Sequence Constructed (ena_sequence_con)
- ENA Sequence Constructed Expanded (ena_sequence_conexp)
- ENA/SVA (ena_sva)
- Ensembl Gene (ensemblgene)
- Ensembl Genomes Gene (ensemblgenomesgene)
- Ensembl Genomes Transcript (ensemblgenomestranscript)
- Ensembl Transcript (ensembltranscript)
- EPO Proteins (epo_prt)
- HGNC (hgnc)
- IMGT/HLA (nucleotide cds) (imgthlacds)
- IMGT/HLA (nucleotide genomic) (imgthlagen)
- IMGT/HLA (protein) (imgthlapro)
- IMGT/LIGM-DB (imgtligm)
- InterPro (interpro)
- IPD-KIR (nucleotide cds) (ipdkircds)
- IPD-KIR (nucleotide genomic) (ipdkirgen)
- IPD-KIR (protein) (ipdkirpro)
- IPD-MHC (nucleotide cds) (ipdmhccds)
- IPD-MHC (nucleotide genomic) (ipdmhcgen)
- IPD-MHC (protein) (ipdmhcpro)
- IPD-NHKIR (nucleotide cds) (ipdnhkircds)
- IPD-NHKIR (nucleotide genomic) (ipdnhkirgen)
- IPD-NHKIR (protein) (ipdnhkirpro)
- IPRMC (iprmc)
- IPRMC UniParc (iprmcuniparc)
- JPO Proteins (jpo_prt)
- KIPO Proteins (kipo_prt)
- MEDLINE (medline)
- MEROPS-MP (mp)
- MEROPS-MPEP (mpep)
- MEROPS-MPRO (mpro)
- Patent DNA NRL1 (nrnl1)
- Patent DNA NRL2 (nrnl2)
- Patent Protein NRL1 (nrpl1)
- Patent Protein NRL2 (nrpl2)
- Patent Equivalents (patent_equivalents)
- PDB (pdb)
- PDBe-KB (pdbekb)
- RefSeq (nucleotide) (refseqn)
- RefSeq (protein) (refseqp)
- Taxonomy (taxonomy)
- UniParc (uniparc)
- UniProtKB (uniprotkb)
- UniRef100 (uniref100)
- UniRef50 (uniref50)
- UniRef90 (uniref90)
- UniSave (unisave)
- USPTO Proteins (uspto_prt)
AlphaFold DB (afdb)
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
AF-P35858-F1, P35858, AF-P35859-F1, P35859
|
| json | default, html, raw |
Accession:
AF-P35858-F1, P35858, AF-P35859-F1, P35859
|
| mmcif | default, html, raw |
Accession:
AF-P35858-F1, P35858, AF-P35859-F1, P35859
|
| pdb | default, html, raw |
Accession:
AF-P35858-F1, P35858, AF-P35859-F1, P35859
|
| fasta | default, html, raw |
Accession:
AF-P35858-F1, P35858, AF-P35859-F1, P35859
|
Data resources: AlphaFold, AlphaFold_UniProtIDs, EMBOSS seqret, NCBI BLAST blastdbcmd, UniProt.org
CDP (cdp)
COVID-19 DATA PORTAL ENA Consensus Sequences
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
ERR7457558, ERR7457272, ERR7370701, ERR7384326, ERR7457401
|
| cdpxml | default, raw |
Accession:
ERR7457558, ERR7457272, ERR7370701, ERR7384326, ERR7457401
|
| fasta | default, html, raw |
Accession:
ERR7457558, ERR7457272, ERR7370701, ERR7384326, ERR7457401
|
Data resources: ENA Browser, EMBOSS seqret, NCBI BLAST blastdbcmd
ChEMBL Targets (chembl)
ChEMBL (Sequences from a manually curated database of bioactive molecules with drug-like properties)
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
CHEMBL3038458_P02708
|
| fasta | default, html, raw |
Accession:
CHEMBL3038458_P02708
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
EDAM (edam)
http://edamontology.sourceforge.net/
EMBRACE Data and Methods (EDAM) Ontology.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
0338, 1929, EDAM_operation:0338, EDAM_format:1929, 0000338, 0001929
|
| obo | default, html, raw |
Id:
0338, 1929, EDAM_operation:0338, EDAM_format:1929, 0000338, 0001929
|
Data resources: EMBOSS ontoget, EMBOSS ontotext
EMDB (emdb)
https://www.ebi.ac.uk/pdbe/emdb/
The Electron Microscopy Data Bank (EMDB) is a public repository for electron microscopy density maps of macromolecular complexes and subcellular structures.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
EMD-0439, EMD-0979, EMD-10515, EMD-10658
|
| map | default, raw |
Id:
EMD-0439, EMD-0979, EMD-10515, EMD-10658
|
| bundle | default, raw |
Id:
EMD-0439, EMD-0979, EMD-10515, EMD-10658
|
| bundlezip | default, raw |
Id:
EMD-0439, EMD-0979, EMD-10515, EMD-10658
|
| xml | default, html, raw |
Id:
EMD-0439, EMD-0979, EMD-10515, EMD-10658
|
Data resources: EMDB FTP @EMBL-EBI, EMDB/PDBe @EMBL-EBI
ENA Coding (ena_coding)
ENA Coding is a database of nucleotide sequences of the CDS (coding sequence) features, as annotated in ENA Sequence database. ENA Coding records contain the nucleotide sequence of the CDS region with accompanying annotation from the parent nucleotide entry and the additional automatically generated annotation.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| annot | default, html, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| embl | default, html, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| emblxml-1.1 | default, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| entrysize | default, html, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| fasta | default, html, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
| seqxml | default, raw |
Accession:
AAA59452
Sequence version: AAA59452.1 |
Data resources: ENA Browser, NCBI BLAST blastdbcmd
ENA Geospatial (ena_geospatial)
ENA Geospatial is a database of nucleotide sequences of the ENA Geospatial Sequence.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
ABBY01000001, KR025151
|
| annot | default, html, raw |
Id:
ABBY01000001, KR025151
|
| embl | default, html, raw |
Id:
ABBY01000001, KR025151
|
| entrysize | default, html, raw |
Id:
ABBY01000001, KR025151
|
| fasta | default, html, raw |
Id:
ABBY01000001, KR025151
|
| seqxml | default, raw |
Id:
ABBY01000001, KR025151
|
Data resources: ENA Browser, ENA Browser
ENA Non-coding (ena_noncoding)
ENA Non-coding is a database of nucleotide sequences of the non-coding RNA features, as annotated in ENA Sequence database. ENA Non-coding records contain the nucleotide sequence of the RNA feature with accompanying annotation from the parent nucleotide entry and the additional automatically generated annotation.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
AB012758.1:1..40:tRNA
|
| annot | default, html, raw |
Id:
AB012758.1:1..40:tRNA
|
| embl | default, html, raw |
Id:
AB012758.1:1..40:tRNA
|
| entrysize | default, html, raw |
Id:
AB012758.1:1..40:tRNA
|
| fasta | default, html, raw |
Id:
AB012758.1:1..40:tRNA
|
| seqxml | default, raw |
Id:
AB012758.1:1..40:tRNA
|
Data resources: ENA Browser, NCBI BLAST blastdbcmd
ENA rRNA (ena_rrna)
ENA rRNA is a database of nucleotide sequences of the ribosomal sequences, as annotated in ENA Sequence database. ENA rRNA records contain the nucleotide sequence of the CDS region with accompanying annotation from the parent nucleotide entry and the additional automatically generated annotation.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
| annot | default, html, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
| embl | default, html, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
| entrysize | default, html, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
| fasta | default, html, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
| seqxml | default, raw |
Id:
FJ198229.1:1212..1484:rRNA
|
Data resources: ENA Browser, NCBI BLAST blastdbcmd
ENA Sequence (ena_sequence)
ENA Sequence (formerly known as EMBL-Bank), Europe's primary nucleotide sequence resource. The main sources of the DNA and RNA sequences in the database are submissions from individual researchers, genome sequencing projects and patent applications.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| annot | default, html, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| embl | default, html, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| emblxml-1.1 | default, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| entrysize | default, html, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| fasta | default, html, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| insdxml | default, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
| seqxml | default, raw |
Accession:
M10051, K00650, D87894, AJ242600
Sequence version: J00231.1, K00650.1, D87894.1, AJ242600.1 |
Data resources: ENA Browser, ENA/SVA, NCBI BLAST blastdbcmd
ENA Sequence Constructed (ena_sequence_con)
The ENA Sequence Constructed database division represents complete genomes and other long sequences constructed from segment entries. Instead of containing the sequence, these entries detail how to assemble the sequence from other ENA Sequence entries.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| annot | default, html, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| embl | default, html, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| emblxml-1.1 | default, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| entrysize | default, html, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| fasta | default, html, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| insdxml | default, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
| seqxml | default, raw |
Accession:
CH003588
Sequence version: CH003588.1 |
Data resources: ENA Browser, ENA/SVA
ENA Sequence Constructed Expanded (ena_sequence_conexp)
The ENA Sequence Constructed database division represents complete genomes and other long sequences constructed from segment entries. Expanded entries include the complete nucleotide sequence for the constructed entry.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| annot | default, html, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| embl | default, html, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| emblxml-1.1 | default, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| entrysize | default, html, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| fasta | default, html, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| insdxml | default, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
| seqxml | default, raw |
Accession:
AL672111
Sequence version: AL672111.1 |
Data resources: ENA Browser
ENA/SVA (ena_sva)
https://www.ebi.ac.uk/cgi-bin/sva/sva.pl
The ENA Sequence Version Archive is a repository of all entries which have ever appeared in the EMBL Nucleotide Sequence Databank (EMBL-Bank) or ENA Sequence databases.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| annot | default, html, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| embl | default, html, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| emblxml-1.1 | default, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| entrysize | default, html, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| fasta | default, html, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| insdxml | default, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
| seqxml | default, raw |
Accession:
Y09633
Sequence version: Y09633.1, Y09633.4 |
Data resources: ENA/SVA
Ensembl Gene (ensemblgene)
Ensembl genome databases for vertebrate species and model organisms, for other species see Ensembl Genomes instead of Ensembl. Gene sequences and annotations.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| csv | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| embl | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| fasta | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| genbank | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| gff2 | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| gff3 | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
| tab | default, raw |
Id:
ENSBTAG00000000988, ENSG00000139618, ENSMUSG00000041147
|
Data resources: Ensembl UK, Ensembl USA East, Ensembl USA West, Ensembl Asia
Ensembl Genomes Gene (ensemblgenomesgene)
http://www.ensemblgenomes.org/
Ensembl Genomes genome databases for metazoa, plants, fungi, protists and bacteria, for vertebrate species and model organisms see Ensembl instead of Ensembl Genomes. Gene sequences and annotations.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| csv | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| embl | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| fasta | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| genbank | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| gff2 | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| gff3 | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
| tab | default, raw |
Id:
Phatr3_J20657, MGG_01236, ANIA_02742, b2736
|
Data resources: EnsemblGenomes UK
Ensembl Genomes Transcript (ensemblgenomestranscript)
http://www.ensemblgenomes.org/
Ensembl Genomes genome databases for metazoa, plants, fungi, protists and bacteria, for vertebrate species and model organisms see Ensembl instead of Ensembl Genomes. Transcript sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
CSON008231-1, PITG_14371T0, YGR147C_mRNA, AAC75778
|
| fasta | default, raw |
Id:
CSON008231-1, PITG_14371T0, YGR147C_mRNA, AAC75778
|
Data resources: EnsemblGenomes UK
Ensembl Transcript (ensembltranscript)
Ensembl genome databases for vertebrate species and model organisms, for other species see Ensembl Genomes instead of Ensembl. Transcript sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
ENSAMET00000013126, ENSBTAT00000001311, ENST00000380152, ENSMUST00000044620
|
| fasta | default, raw |
Id:
ENSAMET00000013126, ENSBTAT00000001311, ENST00000380152, ENSMUST00000044620
|
Data resources: Ensembl UK, Ensembl USA East, Ensembl USA West, Ensembl Asia
EPO Proteins (epo_prt)
https://www.ebi.ac.uk/patentdata/proteins/
Protein sequences appearing in patents from the European Patent Office (EPO).
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
A00022
Sequence version: A00022.1 |
| annot | default, html, raw |
Accession:
A00022
Sequence version: A00022.1 |
| embl | default, html, raw |
Accession:
A00022
Sequence version: A00022.1 |
| entrysize | default, html, raw |
Accession:
A00022
Sequence version: A00022.1 |
| fasta | default, html, raw |
Accession:
A00022
Sequence version: A00022.1 |
| seqxml | default, raw |
Accession:
A00022
Sequence version: A00022.1 |
Data resources: SimpleIndex, EMBOSS entret, NCBI BLAST blastdbcmd, EMBOSS seqret
HGNC (hgnc)
HUGO Gene Nomenclature Committee (HGNC) approved gene name and symbol (short-form abbreviation) for each human gene.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
1101, 3566
Id: BRCA2, FACD |
| tab | default, html, raw |
Accession:
1101, 3566
Id: BRCA2, FACD |
Data resources: GeneNames.org
IMGT/HLA (nucleotide cds) (imgthlacds)
https://www.ebi.ac.uk/imgt/hla/
Sequences of the human major histocompatibility complex (HLA) including the official sequences for the WHO Nomenclature Committee For Factors of the HLA System.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| annot | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| embl | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| entrysize | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| seqxml | default, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| fasta | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IMGT/HLA (nucleotide genomic) (imgthlagen)
https://www.ebi.ac.uk/imgt/hla/
Sequences of the human major histocompatibility complex (HLA) including the official sequences for the WHO Nomenclature Committee For Factors of the HLA System.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| annot | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| embl | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| entrysize | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| seqxml | default, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
| fasta | default, html, raw |
Accession:
HLA00001
Id: HLA-A*01:01:01:01 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IMGT/HLA (protein) (imgthlapro)
https://www.ebi.ac.uk/imgt/hla/
Sequences of the human major histocompatibility complex (HLA) including the official sequences for the WHO Nomenclature Committee For Factors of the HLA System.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
HLA00001
|
| fasta | default, html, raw |
Accession:
HLA00001
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
IMGT/LIGM-DB (imgtligm)
http://imgt.cines.fr/cgi-bin/IMGTlect.jv
A comprehensive database of Immunoglobulins and T cell Receptors from human and other vertebrates.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
A00673, AF003293
|
| annot | default, html, raw |
Accession:
A00673, AF003293
|
| embl | default, html, raw |
Accession:
A00673, AF003293
|
| entrysize | default, html, raw |
Accession:
A00673, AF003293
|
| seqxml | default, raw |
Accession:
A00673, AF003293
|
| fasta | default, html, raw |
Accession:
A00673, AF003293
|
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
InterPro (interpro)
https://www.ebi.ac.uk/interpro/
The InterPro database (Integrated Resource of Protein Domains and Functional Sites) is an integrated documentation resource for protein families, domains and functional sites. It was developed initially as a means of rationalising the complementary efforts of the PROSITE, PRINTS, Pfam and ProDom database projects, but now also includes the SMART, TIGRFAMs, PIR SuperFamilies and most recently SUPERFAMILY databases.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
IPR006212, IPR008266, IPR008958, IPR009030, IPR011009
|
| interpro | default, html, raw |
Id:
IPR006212, IPR008266, IPR008958, IPR009030, IPR011009
|
| interproxml | default, raw |
Id:
IPR006212, IPR008266, IPR008958, IPR009030, IPR011009
|
| tab | default, html, raw |
Id:
IPR006212, IPR008266, IPR008958, IPR009030, IPR011009
|
Data resources: SimpleIndex
IPD-KIR (nucleotide cds) (ipdkircds)
https://www.ebi.ac.uk/ipd/kir/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (KIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| annot | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| embl | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| entrysize | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| seqxml | default, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| fasta | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-KIR (nucleotide genomic) (ipdkirgen)
https://www.ebi.ac.uk/ipd/kir/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (KIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| annot | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| embl | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| entrysize | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| seqxml | default, raw |
Accession:
KIR00001
Id: 3DL3*005 |
| fasta | default, html, raw |
Accession:
KIR00001
Id: 3DL3*005 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-KIR (protein) (ipdkirpro)
https://www.ebi.ac.uk/imgt/hla/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (KIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
KIR00001
|
| fasta | default, html, raw |
Accession:
KIR00001
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
IPD-MHC (nucleotide cds) (ipdmhccds)
https://www.ebi.ac.uk/ipd/mhc/
Sequences of the the major histocompatibility complex in a number of species.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| annot | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| embl | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| entrysize | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| seqxml | default, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| fasta | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-MHC (nucleotide genomic) (ipdmhcgen)
https://www.ebi.ac.uk/ipd/mhc/
Sequences of the the major histocompatibility complex in a number of species.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| annot | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| embl | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| entrysize | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| seqxml | default, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
| fasta | default, html, raw |
Accession:
FISH08286
Id: Sasa-UBA*2403 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-MHC (protein) (ipdmhcpro)
https://www.ebi.ac.uk/ipd/mhc/
Sequences of the the major histocompatibility complex in a number of species.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
FISH08286
|
| fasta | default, html, raw |
Accession:
FISH08286
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
IPD-NHKIR (nucleotide cds) (ipdnhkircds)
https://www.ebi.ac.uk/ipd/nhkir/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (NHKIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
| annot | default, html, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
| embl | default, html, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
| entrysize | default, html, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
| seqxml | default, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
| fasta | default, html, raw |
Accession:
NHP00001
Id: Mamu-KIR3DL01*013 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-NHKIR (nucleotide genomic) (ipdnhkirgen)
https://www.ebi.ac.uk/ipd/nhkir/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (NHKIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
| annot | default, html, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
| embl | default, html, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
| entrysize | default, html, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
| seqxml | default, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
| fasta | default, html, raw |
Accession:
NHP00048
Id: Mamu-KIR1D*002 |
Data resources: EMBOSS entret, EMBOSS seqret, NCBI BLAST blastdbcmd
IPD-NHKIR (protein) (ipdnhkirpro)
https://www.ebi.ac.uk/imgt/hla/
A centralised repository for human Killer-cell Immunoglobulin-like Receptor (NHKIR) sequences.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
NHP00001
|
| fasta | default, html, raw |
Accession:
NHP00001
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
IPRMC (iprmc)
https://www.ebi.ac.uk/interpro/
InterPro Matches Complete (IPRMC) for UniProtKB proteins.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
A0PGH6, A0A2V8
|
| gff2 | default, html, raw |
Id:
A0PGH6, A0A2V8
|
| iprmc | default, html, raw |
Id:
A0PGH6, A0A2V8
|
| iprmctab | default, html, raw |
Id:
A0PGH6, A0A2V8
|
| iprmcxml | default, raw |
Id:
A0PGH6, A0A2V8
|
Data resources: SimpleIndex, InterPro DAS
IPRMC UniParc (iprmcuniparc)
https://www.ebi.ac.uk/interpro/
InterPro Matches Complete (IPRMC) for UniParc proteins.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
UPI0000000001, UPI0000046364, UPI00001B3DCE, 28FE89850863372D
|
| gff2 | default, html, raw |
Id:
UPI0000000001, UPI0000046364, UPI00001B3DCE, 28FE89850863372D
|
| iprmc | default, html, raw |
Id:
UPI0000000001, UPI0000046364, UPI00001B3DCE, 28FE89850863372D
|
| iprmctab | default, html, raw |
Id:
UPI0000000001, UPI0000046364, UPI00001B3DCE, 28FE89850863372D
|
| iprmcxml | default, raw |
Id:
UPI0000000001, UPI0000046364, UPI00001B3DCE, 28FE89850863372D
|
Data resources: SimpleIndex, InterPro UniParc Matches DAS
JPO Proteins (jpo_prt)
https://www.ebi.ac.uk/patentdata/proteins/
Protein sequences appearing in patents from the Japanese Patent Office (JPO).
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
DL607837
|
| annot | default, html, raw |
Accession:
DL607837
|
| embl | default, html, raw |
Accession:
DL607837
|
| entrysize | default, html, raw |
Accession:
DL607837
|
| fasta | default, html, raw |
Accession:
DL607837
|
| seqxml | default, raw |
Accession:
DL607837
|
Data resources: EMBOSS entret, NCBI BLAST blastdbcmd, EMBOSS seqret, DDBJ getentry
KIPO Proteins (kipo_prt)
https://www.ebi.ac.uk/patentdata/proteins/
Protein sequences appearing in patents from the Korean Intellectual Property Office (KIPO).
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
DI500001
|
| annot | default, html, raw |
Accession:
DI500001
|
| embl | default, html, raw |
Accession:
DI500001
|
| entrysize | default, html, raw |
Accession:
DI500001
|
| fasta | default, html, raw |
Accession:
DI500001
|
| seqxml | default, raw |
Accession:
DI500001
|
Data resources: EMBOSS entret, NCBI BLAST blastdbcmd, EMBOSS seqret, DDBJ getentry
MEDLINE (medline)
http://www.nlm.nih.gov/pubs/factsheets/medline.html
MEDLINE contains bibliographic citations and author abstracts from more than 5,000 biomedical journals published in the United States and 70 other countries. The files contains over 19 million citations dating back to the mid-1940's, updated weekly.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
1, 2859121, 17567924
|
| medlinefull | default, html, raw |
Id:
1, 2859121, 17567924
|
| medlineref | default, html, raw |
Id:
1, 2859121, 17567924
|
| bibtex | default, raw |
Id:
1, 2859121, 17567924
|
| endnote | default, raw |
Id:
1, 2859121, 17567924
|
| isi | default, raw |
Id:
1, 2859121, 17567924
|
| modsxml | default, raw |
Id:
1, 2859121, 17567924
|
| pubmedxml | default, raw |
Id:
1, 2859121, 17567924
|
| ris | default, raw |
Id:
1, 2859121, 17567924
|
| wordbibxml | default, raw |
Id:
1, 2859121, 17567924
|
Data resources: Europe PMC, NCBI E-utilities
MEROPS-MP (mp)
MEROPS-MP (Sequences from the full MEROPS collection)
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
MER0000885
|
| fasta | default, html, raw |
Accession:
MER0000885
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
MEROPS-MPEP (mpep)
MEROPS-MPEP (Sequences from the peptidase or inhibitor domain sequence only)
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
MER0000885
|
| fasta | default, html, raw |
Accession:
MER0000885
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
MEROPS-MPRO (mpro)
MEROPS-MPRO (Sequences from the MEROPS scan dataset)
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
MER0000885
|
| fasta | default, html, raw |
Accession:
MER0000885
|
Data resources: NCBI BLAST blastdbcmd, EMBOSS seqret
Patent DNA NRL1 (nrnl1)
https://www.ebi.ac.uk/patentdata/nr/
Non-redundant patent nucleotides level-1. Nucleotide sequences from patents clustered by 100% sequence identity over whole length.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
NRN_DJ207917
|
| annot | default, html, raw |
Id:
NRN_DJ207917
|
| entrysize | default, html, raw |
Id:
NRN_DJ207917
|
| nrl1 | default, html, raw |
Id:
NRN_DJ207917
|
| seqxml | default, raw |
Id:
NRN_DJ207917
|
| fasta | default, html, raw |
Id:
NRN_DJ207917
|
Data resources: SimpleIndex, NCBI BLAST blastdbcmd
Patent DNA NRL2 (nrnl2)
https://www.ebi.ac.uk/patentdata/nr/
Non-redundant patent nucleotides level-2. Nucleotide sequences from patents clustered by patent family and then by 100% sequence identity over whole length.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
NRN006674C5
|
| annot | default, html, raw |
Id:
NRN006674C5
|
| entrysize | default, html, raw |
Id:
NRN006674C5
|
| nrl2 | default, html, raw |
Id:
NRN006674C5
|
| seqxml | default, raw |
Id:
NRN006674C5
|
| fasta | default, html, raw |
Id:
NRN006674C5
|
Data resources: SimpleIndex, NCBI BLAST blastdbcmd
Patent Protein NRL1 (nrpl1)
https://www.ebi.ac.uk/patentdata/nr/
Non-redundant patent proteins level-1. Protein sequences from patents clustered by 100% sequence identity over whole length.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
NRP_AX013047
|
| annot | default, html, raw |
Id:
NRP_AX013047
|
| entrysize | default, html, raw |
Id:
NRP_AX013047
|
| nrl1 | default, html, raw |
Id:
NRP_AX013047
|
| seqxml | default, raw |
Id:
NRP_AX013047
|
| fasta | default, html, raw |
Id:
NRP_AX013047
|
Data resources: SimpleIndex, NCBI BLAST blastdbcmd
Patent Protein NRL2 (nrpl2)
https://www.ebi.ac.uk/patentdata/nr/
Non-redundant patent proteins level-2. Protein sequences from patents clustered by patent family and then by 100% sequence identity over whole length.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
NRP00000001
|
| annot | default, html, raw |
Id:
NRP00000001
|
| entrysize | default, html, raw |
Id:
NRP00000001
|
| nrl2 | default, html, raw |
Id:
NRP00000001
|
| seqxml | default, raw |
Id:
NRP00000001
|
| fasta | default, html, raw |
Id:
NRP00000001
|
Data resources: SimpleIndex, NCBI BLAST blastdbcmd
Patent Equivalents (patent_equivalents)
https://www.ebi.ac.uk/patentdata/
Patent number equivalents (families) and patent classifications for patents containing sequence data. The patent equivalents are obtained from the patent numbers cited in the major sequence databases (e.g. EMBL-Bank and Patent Proteins), which are them expanded into a set of patent equivalents forming a WIPO Simple Patent Family.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
10517942
|
| patent_equivalents | default, html, raw |
Id:
10517942
|
Data resources: SimpleIndex
PDB (pdb)
Macromolecular structures from the Brookhaven Protein Data Bank (PDB). Contains protein and nucleotide structure and sequence data.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| fasta | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| annot | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| mmcif | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| pdb | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| pdbml | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
Data resources: EMBOSS seqret, PDB FTP@EMBL-EBI, PDBe, PDBe entry-files pdb, PDBe entry-files cif, PDBe downloads cif, PDBe downloads updated cif, RCSB PDB, PDBj, PDBe FTP@EMBL-EBI, NCBI BLAST blastdbcmd
PDBe-KB (pdbekb)
https://www.ebi.ac.uk/pdbe/pdbe-kb
PDBe-KB is a collaborative effort between PDBe and a diverse group of bioinformatics resources and research teams.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| fasta | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| annot | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| mmcif | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| pdb | default, html, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
| pdbml | default, raw |
Id:
101D, 1GAG, 10MH, 3E3Q, 3E3Q_A, 3E3Q_a, 3E3QA, 3E3Qa
|
Data resources: EMBOSS seqret, PDB FTP@EMBL-EBI, PDBe, PDBe entry-files pdb, PDBe entry-files cif, PDBe downloads cif, PDBe downloads updated cif, RCSB PDB, PDBj, PDBe FTP@EMBL-EBI, NCBI BLAST blastdbcmd
RefSeq (nucleotide) (refseqn)
https://www.ncbi.nlm.nih.gov/refseq/
The NCBI Reference Sequence project (RefSeq) provides reference sequence standards for the naturally occurring molecules of the central dogma, from chromosomes to mRNAs to proteins.
Data resources: NCBI E-utilities
RefSeq (protein) (refseqp)
https://www.ncbi.nlm.nih.gov/refseq/
The NCBI Reference Sequence project (RefSeq) provides reference sequence standards for the naturally occurring molecules of the central dogma, from chromosomes to mRNAs to proteins.
Data resources: NCBI E-utilities
Taxonomy (taxonomy)
https://www.ncbi.nlm.nih.gov/Taxonomy/
Taxonomic classification of organisms for which there are sequences in the INSDC databases (i.e. DDBJ, EMBL-Bank and GenBank) and many other biological databases.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Id:
3702, 9606
|
| taxonomy | default, html, raw |
Id:
3702, 9606
|
| enataxonomyxml | default, raw |
Id:
3702, 9606
|
| uniprottaxonomyrdfxml | default, raw |
Id:
3702, 9606
|
Data resources: ENA Browser, UniProt.org, EMBOSS taxget
UniParc (uniparc)
The UniProt Archive (UniParc) contains available protein sequences collected from many different sources. The sequence data are archived to facilitate examination of changes to sequence data. Search UniParc if you want to examine the "history" of a particular sequence.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Accession:
UPI0000000001, UPI0000046364, UPI00001B3DCE
|
| fasta | default, raw |
Accession:
UPI0000000001, UPI0000046364, UPI00001B3DCE
|
| seqxml | default, raw |
Accession:
UPI0000000001, UPI0000046364, UPI00001B3DCE
|
| uniparc | default, raw |
Accession:
UPI0000000001, UPI0000046364, UPI00001B3DCE
|
| uniprotrdfxml | default, raw |
Accession:
UPI0000000001, UPI0000046364, UPI00001B3DCE
|
Data resources: UniProt.org, rest.uniprot.org, NCBI BLAST blastdbcmd, EMBOSS seqret
UniProtKB (uniprotkb)
The UniProt Knowledgebase (UniProtKB) is the central access point for extensive curated protein information, including function, classification, and cross-references. Search UniProtKB to retrieve “everything that is known” about a particular sequence.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| fasta | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| annot | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| entrysize | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| gff3 | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| seqxml | default, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| uniprot | default, html, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| uniprotrdfxml | default, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
| uniprotxml | default, raw |
Accession:
P01174, P29306, P68255
Name: WAP_RAT, 1433X_MAIZE, 1433T_RAT Sequence version: P06213.277, P05067.301 |
Data resources: UniProt.org, NCBI BLAST blastdbcmd, EMBOSS entret
UniRef100 (uniref100)
The UniProt Reference Clusters (UniRef) databases combine closely related sequences into a single record to speed searches. There are three different non-redundant databases with different sequence identity cut-offs. In UniRef100, UniRef90 and UniRef50 databases no pair of sequences in the representative set has >100%, >90% or >50% mutual sequence identity. The three UniRef databases allow the user to choose between a fast search and a truly comprehensive one.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
UniRef100_P01173
|
| fasta | default, raw |
Id:
UniRef100_P01173
|
| seqxml | default, raw |
Id:
UniRef100_P01173
|
| uniprotrdfxml | default, raw |
Id:
UniRef100_P01173
|
| uniref100 | default, raw |
Id:
UniRef100_P01173
|
Data resources: UniProt.org, rest.uniprot.org, NCBI BLAST blastdbcmd
UniRef50 (uniref50)
The UniProt Reference Clusters (UniRef) databases combine closely related sequences into a single record to speed searches. There are three different non-redundant databases with different sequence identity cut-offs. In UniRef100, UniRef90 and UniRef50 databases no pair of sequences in the representative set has >100%, >90% or >50% mutual sequence identity. The three UniRef databases allow the user to choose between a fast search and a truly comprehensive one.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
UniRef50_P01174
|
| fasta | default, raw |
Id:
UniRef50_P01174
|
| seqxml | default, raw |
Id:
UniRef50_P01174
|
| uniprotrdfxml | default, raw |
Id:
UniRef50_P01174
|
| uniref50 | default, raw |
Id:
UniRef50_P01174
|
Data resources: UniProt.org, rest.uniprot.org, NCBI BLAST blastdbcmd
UniRef90 (uniref90)
The UniProt Reference Clusters (UniRef) databases combine closely related sequences into a single record to speed searches. There are three different non-redundant databases with different sequence identity cut-offs. In UniRef100, UniRef90 and UniRef50 databases no pair of sequences in the representative set has >100%, >90% or >50% mutual sequence identity. The three UniRef databases allow the user to choose between a fast search and a truly comprehensive one.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Id:
UniRef90_P01173
|
| fasta | default, raw |
Id:
UniRef90_P01173
|
| seqxml | default, raw |
Id:
UniRef90_P01173
|
| uniprotrdfxml | default, raw |
Id:
UniRef90_P01173
|
| uniref90 | default, raw |
Id:
UniRef90_P01173
|
Data resources: UniProt.org, rest.uniprot.org, NCBI BLAST blastdbcmd
UniSave (unisave)
https://www.ebi.ac.uk/uniprot/unisave/
The UniProtKB Sequence/Annotation Version Archive (UniSave) is a repository of UniProtKB/Swiss-Prot and UniProtKB/TrEMBL entry versions.
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, raw |
Accession:
P01174
Entry version: P01174.120, P01174.3 |
| annot | default, raw |
Accession:
P01174
Entry version: P01174.120, P01174.3 |
| entrysize | default, html, raw |
Accession:
P01174
Entry version: P01174.120, P01174.3 |
| fasta | default, raw |
Accession:
P01174
Entry version: P01174.120, P01174.3 |
| uniprot | default, raw |
Accession:
P01174
Entry version: P01174.120, P01174.3 |
Data resources: UniSave
USPTO Proteins (uspto_prt)
https://www.ebi.ac.uk/patentdata/proteins/
Protein sequences appearing in patents from the United States Patent and Trademark Office (USPTO).
| Format | Styles | Example Identifiers |
|---|---|---|
| default | default, html, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
| annot | default, html, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
| embl | default, html, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
| entrysize | default, html, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
| fasta | default, html, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
| seqxml | default, raw |
Accession:
AAA00053
Name: I02590 Sequence version: AAA00053.1 |
Data resources: SimpleIndex, EMBOSS entret, NCBI BLAST blastdbcmd, EMBOSS seqret, NCBI E-utilities