Download
InterProScan
| Name | Description | Link |
|---|---|---|
| InterProScan 5 | Complete documentation regarding downloading, installing and using the latest version of InterProScan 5 *recommended version*. | HTML |
| InterProScan 4.8 | This is the FTP site for downloading InterProScan 4.8, *no longer supported or updated*. | HTML |
InterPro
You can download InterPro's content in a number of ways:
XML and other flat files
| Name | Description | File name | Format | Link |
|---|---|---|---|---|
| Entry list | XML file listing each entry, the signatures that it contains, its abstract, GO terms, etc. - it contains the equivalent to the Entry pages on the web interface. A DTD file exists describing the format. | interpro.xml.gz | gzipped | |
| Protein matched complete | All UniProtKB proteins and the InterPro entries and individual signatures they match, in XML format. Proteins without any matches to InterPro are also included. This file is used by the InterProScan4 software to look up results. A DTD file exists describing the format. | match_complete.xml.gz | gzipped | |
| Unimes sequences | All unimes (metagenomics) sequences and the InterPro entries and individual signatures they match, in XML format. | unimes_match.tar.gz | gzipped | |
| Uniparc sequences | All uniparc (UniProt Archive) sequences and the InterPro entries and individual signatures they match, in XML format. | uniparc_match.tar.gz | gzipped | |
| UniProtKB proteins | All UniProtKB proteins and the InterPro entries and individual signatures they match, in a tab-delimited format. | protein2ipr.dat.gz | gzipped | |
| Entry relationships tree | File describing the hierarchy of relationships between InterPro's entries (i.e. families and their subfamilies) in a simple text-based format. | ParentChildTreeFile.txt | TXT | |
| List of GO terms | Mappings of InterPro entries to Gene Ontology (GO) terms. | interpro2go | TXT | |
| Latest release note | The current release notes, in text-based format. | release_notes.txt | TXT |
See all downloads available on the FTP site.
From individual web pages
- A FASTA file containing all of the sequences matching an InterPro entry is available for download from the Entry Proteins matched sub page
- You can also download the sequences matching an entry for a specific taxon or species from the Entry Species sub page
- TSV (Tab-separated value) files listing the matches of signatures and entries to proteins are available on both the Entry Proteins matched sub page and on individual Protein pages
- You can export the references used in an Entry from the Entry References page (Medline format)
Using BioMart
You can use BioMart to construct more complex queries. BioMart enables you to query and download (for example) InterPro abstracts, protein and signature matches, taxonomy data and GO annotations in a range of formats.


