Files to Download from MEROPS

These files are offered for downloading by the users of MEROPS. Most of the files are compressed, and can be expanded with a utility such as WinZip.

1. MEROPS Release 12.4

The large file "meropsweb124.tar.gz" is a compressed version of all the SQL statements required to build a MySQL instance of the current release of the MEROPS database. Some functions such as the Searches and BLAST server are not supported in this format, however.

DOWNLOAD #1 

2. Peptidase and Inhibitor Accession Numbers

The file "dnld_list.txt" contains a non-redundant list of accession numbers, and enables the user to create a library of peptidase and peptidase inhibitor amino acid sequences. Accession numbers are given for all the peptidases for which we have them, but some sequences that have not been deposited inevitably are not included. To retrieve all the sequences, use the GCG command "fetch@filename" (where "filename" is the name of the text file). Alternatively, use the file of accession numbers as a reduced data set for searching with FastA. When prompted for the database name enter "@filename" instead. Included in each line after the exclamation mark are identifiers for peptidase family and species of organism that allow filtering of the data. The organism identifier is that from the NCBI Taxonomy database shown on each organism card in MEROPS.

DOWNLOAD #2 

3. Peptidase Protein Sequences

The file "pepunit.lib" is a non-redundant library of protein sequences in FastA format the peptidase units and inhibitor units of all the peptidases and peptidase inhibitors that are included in MEROPS. The library can be searched by use of FastA without further modification, but must be converted and indexed for BLAST searches. The restriction of the library to peptidase and inhibitor units should decrease the risk of false positive matches to other domains.

DOWNLOAD #3 

4. Peptidase Full-length Sequences

The file "protease.lib" is a non-redundant library of protein sequences in FastA format for all the peptidases and peptidase inhibitors included in MEROPS. The library can be searched by use of FastA without further modification, but must be converted and indexed for BLAST searches.

DOWNLOAD #4 

5. MEROPS Scan Sequences

The file "merops_scan.lib" is a subset of pepunit.lib containing the sequences used for the MEROPS batch Blast. It contains a non-redundant library of protein sequences in FastA format of the peptidase units for all the family type examples and peptidase/inhibitor holotypes. The library can be searched by use of FastA without further modification, but must be converted and indexed for BLAST searches.

DOWNLOAD #5 

6. Family Protein Sequences

It is possible to download a FastA format file of peptidase/inhibitor unit amino acid sequences for each family in MEROPS. Select your family from the directory listing for the following location: DOWNLOAD #6 

7. MEROPS References

The file "meropsrefs.txt" is a text version of all the SQL statements required to build a MySQL instance of the current release of the MEROPS reference database. DOWNLOAD #7