Publications

Publications

2018

The Human RNA-Binding Proteome and Its Dynamics during Translational Arrest.
Trendel J, Schwarzl T, Horos R, Prakash A, Bateman A, Hentze MW, Krijgsveld J. Cell (2018) DOI: 10.1016/j.cell.2018.11.004
TADOSS: computational estimation of tandem domain swap stability.
Lafita A, Tian P, Best RB, Bateman A. Bioinformatics (Oxford, England) (2018) DOI: 10.1093/bioinformatics/bty974
*
RNAcentral: a hub of information for non-coding RNA sequences.
The RNAcentral Constortium . Nucleic acids research (2018) DOI: 10.1093/nar/gky1034
*
The Gene Ontology Resource: 20 years and still GOing strong.
The Gene Ontology Consortium. Nucleic acids research (2018) DOI: 10.1093/nar/gky1055
*
UniProt: a worldwide hub of protein knowledge.
UniProt Consortium . Nucleic acids research (2018) DOI: 10.1093/nar/gky1049
The Pfam protein families database in 2019.
El-Gebali S, Mistry J, Bateman A, Eddy SR, Luciani A, Potter SC, Qureshi M, Richardson LJ, Salazar GA, Smart A, Sonnhammer ELL, Hirsh L, Paladin L, Piovesan D, Tosatto SCE, Finn RD. Nucleic acids research (2018) DOI: 10.1093/nar/gky995
Non-Coding RNA Analysis Using the Rfam Database.
Kalvari I, Nawrocki EP, Argasinska J, Quinones-Olvera N, Finn RD, Bateman A, Petrov AI. Current protocols in bioinformatics Volume 62 (2018) p.e51 DOI: 10.1002/cpbi.51
*
The Human RNA-Binding Proteome and Its Dynamics During Arsenite-Induced Translational Arrest
Trendel J, Schwarzl T, Prakash A, Bateman A, Hentze MW, Krijgsveld J. Preprint DOI: 10.1101/329995
*
Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm
Bliven SE, Lafita A, Rose PW, Capitani G, Prlic A, Bourne P. Preprint DOI: 10.1101/297960
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.
Höps W, Jeffryes M, Bateman A. F1000Research Volume 7 (2018) p.261 DOI: 10.12688/f1000research.14050.1
The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database.
Rawlings ND, Barrett AJ, Thomas PD, Huang X, Bateman A, Finn RD. Nucleic acids research Volume 46 (2018) p.D624-D632 DOI: 10.1093/nar/gkx1134
Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families.
Kalvari I, Argasinska J, Quinones-Olvera N, Nawrocki EP, Rivas E, Eddy SR, Bateman A, Finn RD, Petrov AI. Nucleic acids research Volume 46 (2018) p.D335-D342 DOI: 10.1093/nar/gkx1038

2017

The HMMER Web Server for Protein Sequence Similarity Search.
Prakash A, Jeffryes M, Bateman A, Finn RD. Current protocols in bioinformatics Volume 60 (2017) p.3.15.1-3.15.23 DOI: 10.1002/cpbi.40
*
Automated evaluation of quaternary structures from protein crystals
Bliven S, Lafita A, Parker A, Capitani G, Duarte JM. Preprint DOI: 10.1101/224717
On expert curation and scalability: UniProtKB/Swiss-Prot as a case study.
Poux S, Arighi CN, Magrane M, Bateman A, Wei CH, Lu Z, Boutet E, Bye-A-Jee H, Famiglietti ML, Roechert B, UniProt Consortium T. Bioinformatics (Oxford, England) Volume 33 (2017) p.3454-3460 DOI: 10.1093/bioinformatics/btx439
The yeast noncoding RNA interaction network.
Panni S, Prakash A, Bateman A, Orchard S. RNA (New York, N.Y.) Volume 23 (2017) p.1479-1492 DOI: 10.1261/rna.060996.117
Eros is a novel transmembrane protein that controls the phagocyte respiratory burst and is essential for innate immunity.
Thomas DC, Clare S, Sowerby JM, Pardo M, Juss JK, Goulding DA, van der Weyden L, Storisteanu D, Prakash A, Espéli M, Flint S, Lee JC, Hoenderdos K, Kane L, Harcourt K, Mukhopadhyay S, Umrania Y, Antrobus R, Nathan JA, Adams DJ, Bateman A, Choudhary JS, Lyons PA, Condliffe AM, Chilvers ER, Dougan G, Smith KG. The Journal of experimental medicine Volume 214 (2017) p.1111-1128 DOI: 10.1084/jem.20161382
Data management: A global coalition to sustain core data.
Anderson WP, Global Life Science Data Resources Working Group. Nature Volume 543 (2017) p.179 DOI: 10.1038/543179a
Structure of the Escherichia coli ProQ RNA-binding protein.
Gonzalez GM, Hardwick SW, Maslen SL, Skehel JM, Holmqvist E, Vogel J, Bateman A, Luisi BF, Broadhurst RW. RNA (New York, N.Y.) Volume 23 (2017) p.696-711 DOI: 10.1261/rna.060343.116

2016

*
On expert curation and sustainability: UniProtKB/Swiss-Prot as a case study
Poux S, Arighi CN, Magrane M, Bateman A, Wei C, Lu Z, Boutet E, Bye-A-Jee H, Famiglietti ML, Roechert B. Preprint DOI: 10.1101/094011
InterPro in 2017-beyond protein family and domain annotations.
Finn RD, Attwood TK, Babbitt PC, Bateman A, Bork P, Bridge AJ, Chang HY, Dosztányi Z, El-Gebali S, Fraser M, Gough J, Haft D, Holliday GL, Huang H, Huang X, Letunic I, Lopez R, Lu S, Marchler-Bauer A, Mi H, Mistry J, Natale DA, Necci M, Nuka G, Orengo CA, Park Y, Pesseat S, Piovesan D, Potter SC, Rawlings ND, Redaschi N, Richardson L, Rivoire C, Sangrador-Vegas A, Sigrist C, Sillitoe I, Smithers B, Squizzato S, Sutton G, Thanki N, Thomas PD, Tosatto SC, Wu CH, Xenarios I, Yeh LS, Young SY, Mitchell AL. Nucleic acids research Volume 45 (2017) p.D190-D199 DOI: 10.1093/nar/gkw1107
*
RNAcentral: a comprehensive database of non-coding RNA sequences.
The RNAcentral Consortium, Petrov AI, Kay SJE, Kalvari I, Howe KL, Gray KA, Bruford EA, Kersey PJ, Cochrane G, Finn RD, Bateman A, Kozomara A, Griffiths-Jones S, Frankish A, Zwieb CW, Lau BY, Williams KP, Chan PP, Lowe TM, Cannone JJ, Gutell R, Machnicka MA, Bujnicki JM, Yoshihama M, Kenmochi N, Chai B, Cole JR, Szymanski M, Karlowski WM, Wood V, Huala E, Berardini TZ, Zhao Y, Chen R, Zhu W, Paraskevopoulou MD, Vlachos IS, Hatzigeorgiou AG, Ma L, Zhang Z, Puetz J, Stadler PF, McDonald D, Basu S, Fey P, Engel SR, Cherry JM, Volders PJ, Mestdagh P, Wower J, Clark MB, Quek XC, Dinger ME. Nucleic acids research Volume 45 (2017) p.D128-D134 DOI: 10.1093/nar/gkw1008
UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB.
Doğan T, MacDougall A, Saidi R, Poggioli D, Bateman A, O'Donovan C, Martin MJ. Bioinformatics (Oxford, England) Volume 32 (2016) p.2264-2271 DOI: 10.1093/bioinformatics/btw114
Patterns of database citation in articles and patents indicate long-term scientific and industry value of biological data resources.
Bousfield D, McEntyre J, Velankar S, Papadatos G, Bateman A, Cochrane G, Kim JH, Graef F, Vartak V, Alako B, Blomberg N. F1000Research Volume 5 (2016) DOI: 10.12688/f1000research.7911.1

2015

The Pfam protein families database: towards a more sustainable future.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A, Salazar GA, Tate J, Bateman A. Nucleic acids research Volume 44 (2016) p.D279-85 DOI: 10.1093/nar/gkv1344
The Importance of Biological Databases in Biological Discovery.
Baxevanis AD, Bateman A. Current protocols in bioinformatics Volume 50 (2015) p.1.1.1-8 DOI: 10.1002/0471250953.bi0101s50
HMMER web server: 2015 update.
Finn RD, Clements J, Arndt W, Miller BL, Wheeler TJ, Schreiber F, Bateman A, Eddy SR. Nucleic acids research Volume 43 (2015) p.W30-8 DOI: 10.1093/nar/gkv397
Domain atrophy creates rare cases of functional partial protein domains.
Prakash A, Bateman A. Genome biology Volume 16 (2015) p.88 DOI: 10.1186/s13059-015-0655-8
Key challenges for the creation and maintenance of specialist protein resources.
Holliday GL, Bairoch A, Bagos PG, Chatonnet A, Craik DJ, Finn RD, Henrissat B, Landsman D, Manning G, Nagano N, O'Donovan C, Pruitt KD, Rawlings ND, Saier M, Sowdhamini R, Spedding M, Srinivasan N, Vriend G, Babbitt PC, Bateman A. Proteins Volume 83 (2015) p.1005-1013 DOI: 10.1002/prot.24803

2014

Using the MEROPS Database for Proteolytic Enzymes and Their Inhibitors and Substrates.
Rawlings ND, Barrett AJ, Bateman A. Current protocols in bioinformatics Volume 48 (2014) p.1.25.1-33 DOI: 10.1002/0471250953.bi0125s48
*
Gene Ontology Consortium: going forward.
Gene Ontology Consortium. Nucleic acids research Volume 43 (2015) p.D1049-56 DOI: 10.1093/nar/gku1179
The InterPro protein families database: the classification resource after 15 years.
Mitchell A, Chang HY, Daugherty L, Fraser M, Hunter S, Lopez R, McAnulla C, McMenamin C, Nuka G, Pesseat S, Sangrador-Vegas A, Scheremetjew M, Rato C, Yong SY, Bateman A, Punta M, Attwood TK, Sigrist CJ, Redaschi N, Rivoire C, Xenarios I, Kahn D, Guyot D, Bork P, Letunic I, Gough J, Oates M, Haft D, Huang H, Natale DA, Wu CH, Orengo C, Sillitoe I, Mi H, Thomas PD, Finn RD. Nucleic acids research Volume 43 (2015) p.D213-21 DOI: 10.1093/nar/gku1243
Rfam 12.0: updates to the RNA families database.
Nawrocki EP, Burge SW, Bateman A, Daub J, Eberhardt RY, Eddy SR, Floden EW, Gardner PP, Jones TA, Tate J, Finn RD. Nucleic acids research Volume 43 (2015) p.D130-7 DOI: 10.1093/nar/gku1063
*
RNAcentral: an international database of ncRNA sequences.
RNAcentral Consortium, Petrov AI, Kay SJE, Gibson R, Kulesha E, Staines D, Bruford EA, Wright MW, Burge S, Finn RD, Kersey PJ, Cochrane G, Bateman A, Griffiths-Jones S, Harrow J, Chan PP, Lowe TM, Zwieb CW, Wower J, Williams KP, Hudson CM, Gutell R, Clark MB, Dinger M, Quek XC, Bujnicki JM, Chua NH, Liu J, Wang H, Skogerbø G, Zhao Y, Chen R, Zhu W, Cole JR, Chai B, Huang HD, Huang HY, Cherry JM, Hatzigeorgiou A, Pruitt KD. Nucleic acids research Volume 43 (2015) p.D123-9 DOI: 10.1093/nar/gku991
*
UniProt: a hub for protein information.
UniProt Consortium. Nucleic acids research Volume 43 (2015) p.D204-12 DOI: 10.1093/nar/gku989
*
Structure and computational analysis of a novel protein with metallopeptidase-like and circularly permuted winged-helix-turn-helix domains reveals a possible role in modified polysaccharide biosynthesis.
Das D, Murzin AG, Rawlings ND, Finn RD, Coggill P, Bateman A, Godzik A, Aravind L. BMC bioinformatics Volume 15 (2014) p.75 DOI: 10.1186/1471-2105-15-75
Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data.
Poux S, Magrane M, Arighi CN, Bridge A, O'Donovan C, Laiho K, UniProt Consortium. Database : the journal of biological databases and curation Volume 2014 (2014) p.bau016 DOI: 10.1093/database/bau016

2013

DATABASE, The Journal of Biological Databases and Curation, is now the official journal of the International Society for Biocuration.
Gaudet P, Munoz-Torres M, Robinson-Rechavi M, Attwood T, Bateman A, Cherry JM, Kania R, O'Donovan C, Yamasaki C. Database : the journal of biological databases and curation Volume 2013 (2013) p.bat077 DOI: 10.1093/database/bat077
iPfam: a database of protein family and domain interactions found in the Protein Data Bank.
Finn RD, Miller BL, Clements J, Bateman A. Nucleic acids research Volume 42 (2014) p.D364-73 DOI: 10.1093/nar/gkt1210
Pfam: the protein families database.
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer EL, Tate J, Punta M. Nucleic acids research Volume 42 (2014) p.D222-30 DOI: 10.1093/nar/gkt1223
*
LUD, a new protein domain associated with lactate utilization.
Hwang WC, Bakolitsa C, Punta M, Coggill PC, Bateman A, Axelrod HL, Rawlings ND, Sedova M, Peterson SN, Eberhardt RY, Aravind L, Pascual J, Godzik A. BMC bioinformatics Volume 14 (2013) p.341 DOI: 10.1186/1471-2105-14-341
*
Filling out the structural map of the NTF2-like superfamily.
Eberhardt RY, Chang Y, Bateman A, Murzin AG, Axelrod HL, Hwang WC, Aravind L. BMC bioinformatics Volume 14 (2013) p.327 DOI: 10.1186/1471-2105-14-327
*
Activities at the Universal Protein Resource (UniProt).
UniProt Consortium. Nucleic acids research Volume 42 (2014) p.D191-8 DOI: 10.1093/nar/gkt1140
TreeFam v9: a new website, more species and orthology-on-the-fly.
Schreiber F, Patricio M, Muffato M, Pignatelli M, Bateman A. Nucleic acids research Volume 42 (2014) p.D922-5 DOI: 10.1093/nar/gkt1055
MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.
Rawlings ND, Waller M, Barrett AJ, Bateman A. Nucleic acids research Volume 42 (2014) p.D503-9 DOI: 10.1093/nar/gkt953
ISCB computational biology Wikipedia competition.
Bateman A, Kelso J, Mietchen D, Macintyre G, Di Domenico T, Abeel T, Logan DW, Radivojac P, Rost B. PLoS computational biology Volume 9 (2013) p.e1003242 DOI: 10.1371/journal.pcbi.1003242
*
Two Pfam protein families characterized by a crystal structure of protein lpg2210 from Legionella pneumophila.
Coggill P, Eberhardt RY, Finn RD, Chang Y, Jaroszewski L, Godzik A, Das D, Xu Q, Axelrod HL, Aravind L, Murzin AG, Bateman A. BMC bioinformatics Volume 14 (2013) p.265 DOI: 10.1186/1471-2105-14-265
*
The COMBREX project: design, methodology, and initial results.
Anton BP, Chang YC, Brown P, Choi HP, Faller LL, Guleria J, Hu Z, Klitgord N, Levy-Moonshine A, Maksad A, Mazumdar V, McGettrick M, Osmani L, Pokrzywa R, Rachlin J, Swaminathan R, Allen B, Housman G, Monahan C, Rochussen K, Tao K, Bhagwat AS, Brenner SE, Columbus L, de Crécy-Lagard V, Ferguson D, Fomenkov A, Gadda G, Morgan RD, Osterman AL, Rodionov DA, Rodionova IA, Rudd KE, Söll D, Spain J, Xu SY, Bateman A, Blumenthal RM, Bollinger JM, Chang WS, Ferrer M, Friedberg I, Galperin MY, Gobeill J, Haft D, Hunt J, Karp P, Klimke W, Krebs C, Macelis D, Madupu R, Martin MJ, Miller JH, O'Donovan C, Palsson B, Ruch P, Setterdahl A, Sutton G, Tate J, Yakunin A, Tchigvintsev D, Plata G, Hu J, Greiner R, Horn D, Sjölander K, Salzberg SL, Vitkup D, Letovsky S, Segrè D, DeLisi C, Roberts RJ, Steffen M, Kasif S. PLoS biology Volume 11 (2013) p.e1001638 DOI: 10.1371/journal.pbio.1001638
The challenge of increasing Pfam coverage of the human proteome.
Mistry J, Coggill P, Eberhardt RY, Deiana A, Giansanti A, Finn RD, Bateman A, Punta M. Database: The Journal of Biological Databases and Curation Volume 2013 (2013) DOI: 10.1093/database/bat040
*
Alternative splicing of intrinsically disordered regions and rewiring of protein interactions.
Buljan M, Chalancon G, Dunker AK, Bateman A, Balaji S, Fuxreiter M, Babu MM. Current opinion in structural biology Volume 23 (2013) p.443-450 DOI: 10.1016/j.sbi.2013.03.006
The challenge of increasing Pfam coverage of the human proteome.
Mistry J, Coggill P, Eberhardt RY, Deiana A, Giansanti A, Finn RD, Bateman A, Punta M. Database : the journal of biological databases and curation Volume 2013 (2013) p.bat023 DOI: 10.1093/database/bat023
Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions.
Mistry J, Finn RD, Eddy SR, Bateman A, Punta M. Nucleic acids research Volume 41 (2013) p.e121 DOI: 10.1093/nar/gkt263
*
A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium.
Barquist L, Langridge GC, Turner DJ, Phan MD, Turner AK, Bateman A, Parkhill J, Wain J, Gardner PP. Nucleic acids research Volume 41 (2013) p.4549-4564 DOI: 10.1093/nar/gkt148
The SHOCT domain: a widespread domain under-represented in model organisms.
Eberhardt RY, Bartholdson SJ, Punta M, Bateman A. PloS one Volume 8 (2013) p.e57848 DOI: 10.1371/journal.pone.0057848
*
Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling.
Clarke M, Lohan AJ, Liu B, Lagkouvardos I, Roy S, Zafar N, Bertelli C, Schilde C, Kianianmomeni A, Bürglin TR, Frech C, Turcotte B, Kopec KO, Synnott JM, Choo C, Paponov I, Finkler A, Heng Tan CS, Hutchins AP, Weinmeier T, Rattei T, Chu JS, Gimenez G, Irimia M, Rigden DJ, Fitzpatrick DA, Lorenzo-Morales J, Bateman A, Chiu CH, Tang P, Hegemann P, Fromm H, Raoult D, Greub G, Miranda-Saavedra D, Chen N, Nash P, Ginger ML, Horn M, Schaap P, Caler L, Loftus BJ. Genome biology Volume 14 (2013) p.R11 DOI: 10.1186/gb-2013-14-2-r11

2012

*
Rfam 11.0: 10 years of RNA families.
Burge SW, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki EP, Eddy SR, Gardner PP, Bateman A. Nucleic acids research Volume 41 (2013) p.D226-32 DOI: 10.1093/nar/gks1005
*
Recent advances in biocuration: meeting report from the Fifth International Biocuration Conference.
Gaudet P, Arighi C, Bastian F, Bateman A, Blake JA, Cherry MJ, D'Eustachio P, Finn R, Giglio M, Hirschman L, Kania R, Klimke W, Martin MJ, Karsch-Mizrachi I, Munoz-Torres M, Natale D, O'Donovan C, Ouellette F, Pruitt KD, Robinson-Rechavi M, Sansone SA, Schofield P, Sutton G, Van Auken K, Vasudevan S, Wu C, Young J, Mazumder R. Database : the journal of biological databases and curation Volume 2012 (2012) p.bas036 DOI: 10.1093/database/bas036
*
Tissue-specific splicing of disordered segments that embed binding motifs rewires protein interaction networks.
Buljan M, Chalancon G, Eustermann S, Wagner GP, Fuxreiter M, Bateman A, Babu MM. Molecular cell Volume 46 (2012) p.871-883 DOI: 10.1016/j.molcel.2012.05.039
*
InterPro in 2011: new developments in the family and domain prediction database.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, de Castro E, Coggill P, Corbett M, Das U, Daugherty L, Duquenne L, Finn RD, Fraser M, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, McMenamin C, Mi H, Mutowo-Muellenet P, Mulder N, Natale D, Orengo C, Pesseat S, Punta M, Quinn AF, Rivoire C, Sangrador-Vegas A, Selengut JD, Sigrist CJA, Scheremetjew M, Tate J, Thimmajanarthanan M, Thomas PD, Wu CH, Yeats C, Yong S. Nucleic acids research Volume 40 (2012) p.4725-4725 DOI: 10.1093/nar/gks456
*
The YARHG domain: an extracellular domain in search of a function.
Coggill P, Bateman A. PloS one Volume 7 (2012) p.e35575 DOI: 10.1371/journal.pone.0035575
Biocurators and biocuration: surveying the 21st century challenges.
Burge S, Attwood TK, Bateman A, Berardini TZ, Cherry M, O'Donovan C, Xenarios L, Gaudet P. Database : the journal of biological databases and curation Volume 2012 (2012) p.bar059 DOI: 10.1093/database/bar059
*
AntiFam: a tool to help identify spurious ORFs in protein annotation.
Eberhardt RY, Haft DH, Punta M, Martin M, O'Donovan C, Bateman A. Database : the journal of biological databases and curation Volume 2012 (2012) p.bas003 DOI: 10.1093/database/bas003
*
Bioimage informatics: a new category in Bioinformatics.
Peng H, Bateman A, Valencia A, Wren JD. Bioinformatics (Oxford, England) Volume 28 (2012) p.1057 DOI: 10.1093/bioinformatics/bts111

2011

*
Making your database available through Wikipedia: the pros and cons.
Finn RD, Gardner PP, Bateman A. Nucleic acids research Volume 40 (2012) p.D9-12 DOI: 10.1093/nar/gkr1195
*
The Pfam protein families database.
Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD. Nucleic acids research Volume 40 (2012) p.D290-301 DOI: 10.1093/nar/gkr1065
InterPro in 2011: new developments in the family and domain prediction database.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, de Castro E, Coggill P, Corbett M, Das U, Daugherty L, Duquenne L, Finn RD, Fraser M, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, McMenamin C, Mi H, Mutowo-Muellenet P, Mulder N, Natale D, Orengo C, Pesseat S, Punta M, Quinn AF, Rivoire C, Sangrador-Vegas A, Selengut JD, Sigrist CJ, Scheremetjew M, Tate J, Thimmajanarthanan M, Thomas PD, Wu CH, Yeats C, Yong SY. Nucleic acids research Volume 40 (2012) p.D306-12 DOI: 10.1093/nar/gkr948
*
MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.
Rawlings ND, Barrett AJ, Bateman A. Nucleic acids research Volume 40 (2012) p.D343-50 DOI: 10.1093/nar/gkr987