Claire O’Donovan

Claire O’Donovan

Team Leader, UniProt content

BSc (Hons) in Biochemistry, 1992, University College Cork, Ireland. Diploma in Computer Science, 1993, University College Cork, Ireland. At EMBL since 1993, at EMBL-EBI since 1994. Team Leader since 2009.

Tel:+44 (0)1223 494 460 / Fax:+44 (0)1223 494 468

O'Donovan team

The central activity of Claire O'Donovan's team is the biocuration of our UniProt databases.

Biocuration involves the interpretation and integration of information relevant to biology into a database or resource that enables integration of the scientific literature as well as large data sets. Accurate and comprehensive representation of biological knowledge, as well as easy access to this data for working scientists and a basis for computational analysis, are primary goals of biocuration.

UniProt manual curation: Manual curation involves a critical review of experimental and predicted data for each protein, as well as manual verification of each protein sequence. The curation methods we apply to UniProtKB/Swiss-Prot include manual extraction and structuring of experimental information from the literature, manual verification of results from computational analyses, quality assessment mining and integration of large-scale data sets and continuous updating as new information becomes available.

UniProt automatic annotation: UniProt has developed two complementary approaches in order to automatically annotate protein sequences with a high degree of accuracy. UniRule is a collection of manually curated annotation rules, which define annotations that can be propagated based on specific conditions. The Statistical Automatic Annotation System (SAAS) is an automatic decision-tree-based rule-generating system. The central components of these approaches are rules based on InterPro classification and the manually curated data in UniProtKB/Swiss-Prot from the experimental literature and InterPro classification.

UniProt GO annotation (GOA): The UniProt GO annotation (GOA) program aims to add high-quality GO annotations to proteins in the UniProt Knowledgebase (UniProtKB). The assignment of GO terms to UniProt records is an integral part of UniProt biocuration. We supplement UniProt manual and electronic GO annotations are supplemented with manual annotations supplied by external collaborating GO Consortium groups. This ensures that users have a comprehensive GO annotation dataset. UniProt-GOA is a member of the GO consortium.

Claire's team works in a fully complementary fashion with Maria-Jesus Martin's UniProt development group to provide essential resources to the biological community such that the databases have become an integral part of the tools researchers use on a daily basis for their work. The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and functional annotation data. UniProt is comprised of four major components, each optimized for different uses. The UniProt Knowledgebase (UniProtKB) is an expertly curated database, a central access point for integrated protein information with cross-references to multiple sources.

The UniProt Archive (UniParc) is a comprehensive sequence repository, reflecting the history of all protein sequences. UniProt Reference Clusters (UniRef) merge closely related sequences based on sequence identity to speed up searches while the UniProt Metagenomic and Environmental Sequences database (UniMES) was created to respond to the expanding area of metagenomic data.

Publications

2013

Nucleic Acids Research. Volume 41, Number D1, (2013), p.D773–D780 doi:

2012

Database: the journal of biological databases and curation. Volume 2012, (2012), doi:
Database: The Journal of Biological Databases and Curation. Volume 2012, (2012), doi:

2011

Methods Mol. Biol. Volume 694, (2011), p.25–35 doi:

2010

2009

Bioinformatics. Volume 25, Number 22, (2009), p.3045–3046 doi:
Database: the journal of biological databases and curation. Volume 2009, (2009), doi:

2008

Nucleic acids research. Volume 36, Number Database issue, (2008), p.D1028 doi:

2007

2006

2005

Nucleic acids research. Volume 33, Number suppl 1, (2005), p.D154–D159
The Proteomics Protocols Handbook. (2005), p.609–618 doi:

2004

Nucleic acids research. Volume 32, Number suppl 1, (2004), p.D115–D119
Bioinformatics. Volume 20, Number 17, (2004), p.3236–3237
Genetic engineering. Volume 26, (2004), p.13 doi:

2003

Pharmacogenomics. Volume 4, Number 3, (2003), p.343–350 doi:

2002

Briefings in Bioinformatics. Volume 3, Number 3, (2002), p.275–284 doi:

2001

TRENDS in Biotechnology. Volume 19, Number 5, (2001), p.178–180 doi:

2000

1999

Bioinformatics. Volume 15, Number 3, (1999), p.258–259 doi:

1998

Glycoconjugate journal. Volume 15, Number 5, (1998), p.507–509 doi:

1997

1990

1985

Submitted

Services

Team members

Joanna Argasinska
Ramona Britto
Gayatri Chavali
Elena Cibrian-Uhalte
Amy Cottage
Paul Gane
Penelope Garmiri
Emma Hatton-Ellis
Reija Hieta
Rachael Huntley
Duncan Legge
Alistair MacDougall
Michele Magrane
Prudence Mutowo
Klemens Pichler
Lorna Richardson
Aleksandra Shypitsyna