Publications

Publications

2017

Structure of the Escherichia coli ProQ RNA-binding protein.
Gonzalez GM, Hardwick SW, Maslen SL, Skehel JM, Holmqvist E, Vogel J, Bateman A, Luisi BF, Broadhurst RW.
RNA (New York, N.Y.) Volume 23 (2017) p.696-711

RNAcentral: a comprehensive database of non-coding RNA sequences.
The RNAcentral Consortium.
Nucleic Acids Research Volume 45 (2017) p.D128-D134

The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database.
Rawlings ND, Barrett AJ, Thomas PD, Huang X, Bateman A, Finn RD.
Nucleic Acids Research Volume (2017) p.

Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families.
Kalvari I, Argasinska J, Quinones-Olvera N, Nawrocki EP, Rivas E, Eddy SR, Bateman A, Finn RD, Petrov AI.
Nucleic Acids Research Volume (2017) p.

Biological and functional relevance of CASP predictions.
Liu T, Ish-Shalom S, Torng W, Lafita A, Bock C, Mort M, Cooper DN, Bliven S, Capitani G, Mooney SD, Altman RB.
Proteins Volume (2017) p.

The yeast noncoding RNA interaction network.
Panni S, Prakash A, Bateman A, Orchard S.
RNA (New York, N.Y.) Volume 23 (2017) p.1479-1492

InterPro in 2017-beyond protein family and domain annotations.
Finn RD, Attwood TK, Babbitt PC, Bateman A, Bork P, Bridge AJ, Chang HY, Dosztányi Z, El-Gebali S, Fraser M, Gough J, Haft D, Holliday GL, Huang H, Huang X, Letunic I, Lopez R, Lu S, Marchler-Bauer A, Mi H, Mistry J, Natale DA, Necci M, Nuka G, Orengo CA, Park Y, Pesseat S, Piovesan D, Potter SC, Rawlings ND, Redaschi N, Richardson L, Rivoire C, Sangrador-Vegas A, Sigrist C, Sillitoe I, Smithers B, Squizzato S, Sutton G, Thanki N, Thomas PD, Tosatto SC, Wu CH, Xenarios I, Yeh LS, Young SY, Mitchell AL.
Nucleic Acids Research Volume 45 (2017) p.D190-D199

Assessment of protein assembly prediction in CASP12.
Lafita A, Bliven S, Kryshtafovych A, Bertoni M, Monastyrskyy B, Duarte JM, Schwede T, Capitani G.
Proteins Volume (2017) p.

2016

Patterns of database citation in articles and patents indicate long-term scientific and industry value of biological data resources.
Bousfield D, McEntyre J, Velankar S, Papadatos G, Bateman A, Cochrane G, Kim JH, Graef F, Vartak V, Alako B, Blomberg N.
F1000Research Volume 5 (2016) p.

UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB.
Doğan T, MacDougall A, Saidi R, Poggioli D, Bateman A, O'Donovan C, Martin MJ.
Bioinformatics (Oxford, England) Volume 32 (2016) p.2264-2271

The Pfam protein families database: towards a more sustainable future.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A, Salazar GA, Tate J, Bateman A.
Nucleic Acids Research Volume 44 (2016) p.D279-85

2015

UniProt: a hub for protein information.
UniProt Consortium.
Nucleic acids research Volume 43 (2015) p.D204-12

Domain atrophy creates rare cases of functional partial protein domains.
Prakash A, Bateman A.
Genome biology Volume 16 (2015) p.88

Rfam 12.0: updates to the RNA families database.
Nawrocki EP, Burge SW, Bateman A, Daub J, Eberhardt RY, Eddy SR, Floden EW, Gardner PP, Jones TA, Tate J, Finn RD.
Nucleic acids research Volume 43 (2015) p.D130-7

HMMER web server: 2015 update.
Finn RD, Clements J, Arndt W, Miller BL, Wheeler TJ, Schreiber F, Bateman A, Eddy SR.
Nucleic Acids Research Volume 43 (2015) p.W30-8

RNAcentral: an international database of ncRNA sequences.
RNAcentral Consortium.
Nucleic Acids Research Volume 43 (2015) p.D123-9

The Importance of Biological Databases in Biological Discovery.
Baxevanis AD, Bateman A.
Current protocols in bioinformatics Volume 50 (2015) p.1.1.1-8

Gene Ontology Consortium: going forward.
Gene Ontology Consortium.
Nucleic acids research Volume 43 (2015) p.D1049-56

Key challenges for the creation and maintenance of specialist protein resources.
Holliday GL, Bairoch A, Bagos PG, Chatonnet A, Craik DJ, Finn RD, Henrissat B, Landsman D, Manning G, Nagano N, O'Donovan C, Pruitt KD, Rawlings ND, Saier M, Sowdhamini R, Spedding M, Srinivasan N, Vriend G, Babbitt PC, Bateman A.
Proteins Volume 83 (2015) p.1005-1013

The InterPro protein families database: the classification resource after 15 years.
Mitchell A, Chang HY, Daugherty L, Fraser M, Hunter S, Lopez R, McAnulla C, McMenamin C, Nuka G, Pesseat S, Sangrador-Vegas A, Scheremetjew M, Rato C, Yong SY, Bateman A, Punta M, Attwood TK, Sigrist CJ, Redaschi N, Rivoire C, Xenarios I, Kahn D, Guyot D, Bork P, Letunic I, Gough J, Oates M, Haft D, Huang H, Natale DA, Wu CH, Orengo C, Sillitoe I, Mi H, Thomas PD, Finn RD.
Nucleic acids research Volume 43 (2015) p.D213-21

2014

Pfam: the protein families database.
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer EL, Tate J, Punta M.
Nucleic acids research Volume 42 (2014) p.D222-30

MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.
Rawlings ND, Waller M, Barrett AJ, Bateman A.
Nucleic acids research Volume 42 (2014) p.D503-9

iPfam: a database of protein family and domain interactions found in the Protein Data Bank.
Finn RD, Miller BL, Clements J, Bateman A.
Nucleic Acids Research Volume 42 (2014) p.D364-73

Activities at the Universal Protein Resource (UniProt).
UniProt Consortium.
Nucleic acids research Volume 42 (2014) p.D191-8

Structure and computational analysis of a novel protein with metallopeptidase-like and circularly permuted winged-helix-turn-helix domains reveals a possible role in modified polysaccharide biosynthesis.
Das D, Murzin AG, Rawlings ND, Finn RD, Coggill P, Bateman A, Godzik A, Aravind L.
BMC bioinformatics Volume 15 (2014) p.75

TreeFam v9: a new website, more species and orthology-on-the-fly.
Schreiber F, Patricio M, Muffato M, Pignatelli M, Bateman A.
Nucleic Acids Research Volume 42 (2014) p.D922-5

Using the MEROPS Database for Proteolytic Enzymes and Their Inhibitors and Substrates.
Rawlings ND, Barrett AJ, Bateman A.
Current protocols in bioinformatics Volume 48 (2014) p.1.25.1-33

Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data.
Poux S, Magrane M, Arighi CN, Bridge A, O'Donovan C, Laiho K, UniProt Consortium.
Database : the journal of biological databases and curation Volume 2014 (2014) p.bau016

2013

Filling out the structural map of the NTF2-like superfamily.
Eberhardt RY, Chang Y, Bateman A, Murzin AG, Axelrod HL, Hwang WC, Aravind L.
BMC Bioinformatics Volume 14 (2013) p.327

Rfam 11.0: 10 years of RNA families.
Burge SW, Daub J, Eberhardt R, Tate J, Barquist L, Nawrocki EP, Eddy SR, Gardner PP, Bateman A.
Nucleic acids research Volume 41 (2013) p.D226-32

LUD, a new protein domain associated with lactate utilization.
Hwang WC, Bakolitsa C, Punta M, Coggill PC, Bateman A, Axelrod HL, Rawlings ND, Sedova M, Peterson SN, Eberhardt RY, Aravind L, Pascual J, Godzik A.
BMC Bioinformatics Volume 14 (2013) p.341

Two Pfam protein families characterized by a crystal structure of protein lpg2210 from Legionella pneumophila.
Coggill P, Eberhardt RY, Finn RD, Chang Y, Jaroszewski L, Godzik A, Das D, Xu Q, Axelrod HL, Aravind L, Murzin AG, Bateman A.
BMC Bioinformatics Volume 14 (2013) p.265

The SHOCT domain: a widespread domain under-represented in model organisms.
Eberhardt RY, Bartholdson SJ, Punta M, Bateman A.
PloS one Volume 8 (2013) p.e57848

The COMBREX project: design, methodology, and initial results.
Anton BP, Chang YC, Brown P, Choi HP, Faller LL, Guleria J, Hu Z, Klitgord N, Levy-Moonshine A, Maksad A, Mazumdar V, McGettrick M, Osmani L, Pokrzywa R, Rachlin J, Swaminathan R, Allen B, Housman G, Monahan C, Rochussen K, Tao K, Bhagwat AS, Brenner SE, Columbus L, de Crécy-Lagard V, Ferguson D, Fomenkov A, Gadda G, Morgan RD, Osterman AL, Rodionov DA, Rodionova IA, Rudd KE, Söll D, Spain J, Xu SY, Bateman A, Blumenthal RM, Bollinger JM, Chang WS, Ferrer M, Friedberg I, Galperin MY, Gobeill J, Haft D, Hunt J, Karp P, Klimke W, Krebs C, Macelis D, Madupu R, Martin MJ, Miller JH, O'Donovan C, Palsson B, Ruch P, Setterdahl A, Sutton G, Tate J, Yakunin A, Tchigvintsev D, Plata G, Hu J, Greiner R, Horn D, Sjölander K, Salzberg SL, Vitkup D, Letovsky S, Segrè D, DeLisi C, Roberts RJ, Steffen M, Kasif S.
PLoS biology Volume 11 (2013) p.e1001638

DATABASE, The Journal of Biological Databases and Curation, is now the official journal of the International Society for Biocuration.
Gaudet P, Munoz-Torres M, Robinson-Rechavi M, Attwood T, Bateman A, Cherry JM, Kania R, O'Donovan C, Yamasaki C.
Database : the journal of biological databases and curation Volume 2013 (2013) p.bat077

ISCB computational biology Wikipedia competition.
Bateman A, Kelso J, Mietchen D, Macintyre G, Di Domenico T, Abeel T, Logan DW, Radivojac P, Rost B.
PLoS Computational Biology Volume 9 (2013) p.e1003242

The challenge of increasing Pfam coverage of the human proteome.
Mistry J, Coggill P, Eberhardt RY, Deiana A, Giansanti A, Finn RD, Bateman A, Punta M.
Database: The Journal of Biological Databases and Curation Volume 2013 (2013) p.

Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling.
Clarke M, Lohan AJ, Liu B, Lagkouvardos I, Roy S, Zafar N, Bertelli C, Schilde C, Kianianmomeni A, Bürglin TR, Frech C, Turcotte B, Kopec KO, Synnott JM, Choo C, Paponov I, Finkler A, Heng Tan CS, Hutchins AP, Weinmeier T, Rattei T, Chu JS, Gimenez G, Irimia M, Rigden DJ, Fitzpatrick DA, Lorenzo-Morales J, Bateman A, Chiu CH, Tang P, Hegemann P, Fromm H, Raoult D, Greub G, Miranda-Saavedra D, Chen N, Nash P, Ginger ML, Horn M, Schaap P, Caler L, Loftus BJ.
Genome biology Volume 14 (2013) p.R11

A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium.
Barquist L, Langridge GC, Turner DJ, Phan MD, Turner AK, Bateman A, Parkhill J, Wain J, Gardner PP.
Nucleic acids research Volume 41 (2013) p.4549-4564

Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions.
Mistry J, Finn RD, Eddy SR, Bateman A, Punta M.
Nucleic Acids Research Volume 41 (2013) p.e121

The challenge of increasing Pfam coverage of the human proteome.
Mistry J, Coggill P, Eberhardt RY, Deiana A, Giansanti A, Finn RD, Bateman A, Punta M.
Database : the journal of biological databases and curation Volume 2013 (2013) p.bat023

Alternative splicing of intrinsically disordered regions and rewiring of protein interactions.
Buljan M, Chalancon G, Dunker AK, Bateman A, Balaji S, Fuxreiter M, Babu MM.
Current opinion in structural biology Volume 23 (2013) p.443-450

2012

Tissue-specific splicing of disordered segments that embed binding motifs rewires protein interaction networks.
Buljan M, Chalancon G, Eustermann S, Wagner GP, Fuxreiter M, Bateman A, Babu MM.
Molecular cell Volume 46 (2012) p.871-883

The Pfam protein families database.
Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD.
Nucleic acids research Volume 40 (2012) p.D290-301

Bioimage informatics: a new category in Bioinformatics.
Peng H, Bateman A, Valencia A, Wren JD.
Bioinformatics (Oxford, England) Volume 28 (2012) p.1057

InterPro in 2011: new developments in the family and domain prediction database.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, de Castro E, Coggill P, Corbett M, Das U, Daugherty L, Duquenne L, Finn RD, Fraser M, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, McMenamin C, Mi H, Mutowo-Muellenet P, Mulder N, Natale D, Orengo C, Pesseat S, Punta M, Quinn AF, Rivoire C, Sangrador-Vegas A, Selengut JD, Sigrist CJ, Scheremetjew M, Tate J, Thimmajanarthanan M, Thomas PD, Wu CH, Yeats C, Yong SY.
Nucleic Acids Research Volume 40 (2012) p.D306-12

MEROPS: the database of proteolytic enzymes, their substrates and inhibitors.
Rawlings ND, Barrett AJ, Bateman A.
Nucleic acids research Volume 40 (2012) p.D343-50

Making your database available through Wikipedia: the pros and cons.
Finn RD, Gardner PP, Bateman A.
Nucleic acids research Volume 40 (2012) p.D9-12

Biocurators and biocuration: surveying the 21st century challenges.
Burge S, Attwood TK, Bateman A, Berardini TZ, Cherry M, O'Donovan C, Xenarios L, Gaudet P.
Database : the journal of biological databases and curation Volume 2012 (2012) p.bar059

AntiFam: a tool to help identify spurious ORFs in protein annotation.
Eberhardt RY, Haft DH, Punta M, Martin M, O'Donovan C, Bateman A.
Database : the journal of biological databases and curation Volume 2012 (2012) p.bas003

The YARHG domain: an extracellular domain in search of a function.
Coggill P, Bateman A.
PloS one Volume 7 (2012) p.e35575

InterPro in 2011: new developments in the family and domain prediction database.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S, de Castro E, Coggill P, Corbett M, Das U, Daugherty L, Duquenne L, Finn RD, Fraser M, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, McMenamin C, Mi H, Mutowo-Muellenet P, Mulder N, Natale D, Orengo C, Pesseat S, Punta M, Quinn AF, Rivoire C, Sangrador-Vegas A, Selengut JD, Sigrist CJA, Scheremetjew M, Tate J, Thimmajanarthanan M, Thomas PD, Wu CH, Yeats C, Yong S.
Nucleic Acids Research Volume 40 (2012) p.4725-4725

Recent advances in biocuration: meeting report from the Fifth International Biocuration Conference.
Gaudet P, Arighi C, Bastian F, Bateman A, Blake JA, Cherry MJ, D'Eustachio P, Finn R, Giglio M, Hirschman L, Kania R, Klimke W, Martin MJ, Karsch-Mizrachi I, Munoz-Torres M, Natale D, O'Donovan C, Ouellette F, Pruitt KD, Robinson-Rechavi M, Sansone SA, Schofield P, Sutton G, Van Auken K, Vasudevan S, Wu C, Young J, Mazumder R.
Database : the journal of biological databases and curation Volume 2012 (2012) p.bas036