Release Notes

Release 6.1, Thur Apr 24 14:00 2003

Acknowledgements

R.Apweiler (1), T.K.Attwood (4), A.Bairoch (2), A.Bateman (5), D.Binns (1), P.Bradley (1,4), P.Bork (8), P.Bucher (3), L. Cerutti (3), R.Copley (8), E.Courcelle (6), U.Das (1) R.Durbin (5), W.Fleischmann (1), J.Gouzy (6), S.Griffiths-Jones (5) D.Haft (9), N.Hulo (2), D.Kahn (6), A.Kanapin (1), M.Krestyaninova (1), D.Lonsdale (1), R.Lopez (1), I.Letunic (8),Anastasia N. Nikolskaya (10), N.Mulder (1), S. Orchard (1), M.Pagni (3), D.Peyruc (6), C.Ponting (7), E.Quevillon (1), F.Servant (1), C.Sigrist (2), D.J.Studholme (5), Cathy H. Wu (10) .

(1) EMBL Outstation European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK;
(2) Swiss Institute for Bioinformatics, Geneva, Switzerland;
(3) Swiss Institute for Experimental Cancer Research, Lausanne, Switzerland;
(4) School of Biological Sciences, The University of Manchester, Manchester, UK;
(5) The Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK;
(6) CNRS/INRA, Toulouse, France;
(7) MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, UK;
(8) Biocomputing Unit, EMBL-Heidelberg, Germany;
(9) The Institute for Genomic Research, Maryland, USA;
(10) Protein Information Resource, Georgetown University Medical Center, Washington, D.C., USA.

Introduction

The databases Swiss-Prot, TrEMBL, PROSITE, PRINTS, Pfam, and ProDom joined forces to launch an Integrated Documentation Resource of Protein Families, Domains and Functional Sites, abbreviated InterPro. SMART, TIGRFAMs and most recently PIR SuperFamilies have subsequently joined InterPro. A detailed description of the project can be found in the user manual.

Changes since last major release

The first batch of PIR SuperFamily HMMs have been integrated.

A new-look web site has been launched for the InterPro and QuickGO browsers. The pages have changed to reflect the new style of the EBI pages in general, but in addition, have increased capabilities and new features. The ordering of fields in the InterPro entries has changed to provide links to the protein matches at the top of the entry. An additional graphical match view is available in an overview format, which is also used in the entry to display the example lists. It is now possible to sort the protein matches by Swiss-Prot ID. The simple text search has been extended to include different search options and results outputs, and all pages may be viewed in the user's format of choice out of Normal, Printer friendly, Text, simple HTML and XML.

The protein matches have been updated according to the latest updates of Swiss-Prot and TrEMBL. InterProScan has also been updated to include new member database signatures. The number of mappings to the Gene Ontology (GO) classification system has increased since the last release, and where applicable, InterPro entries now have links to the Transport Classification database.

We have also introduced FAQs pages to provide further help to InterPro users, and an online tutorial.

Contents of current release

InterPro release 6.1 contains 7785 entries, representing 1744 domains, 5877 families, 147 repeats, and 17 sites. Overall, there are 3406983 InterPro hits from 729397 SWISS-PROT + TrEMBL protein sequences. A complete list is available from the ftp site.

The current version of InterPro consists of:

DATABASE VERSION ENTRIES
SWISS-PROT 41.3 122564
PRINTS 35.0 1750
TrEMBL 23.0 830525
Pfam 8.0 5193
PROSITE patterns 17.37 1605
PROSITE preprofiles N/A 150
ProDom 2002.1 1021
InterPro 6.1 7785
Smart 3.4 654
TIGRFAMs 2.1 1614
PIR SuperFamily 1.0 50

Forthcoming changes

The next release of InterPro will be a point release, 6.2, scheduled for July 2003. A project is underway to incorporate curated protein secondary and tertiary structure information and we hope to provide such links from InterPro entries where applicable in the near future.