spacer

Rebholz Group

The group focuses on extraction of facts from scientific literature in molecular biology. This is mainly based but not limited to matching of language patterns. In addition we do research on the disambiguation of semantic types, e.g. proteins, genes, species, drugs, and on automatic methods to identify language patterns. Both approaches require machine learning expertise and domain knowledge. The group has strong experience in Natural Language Processing methods (NLP) in the biomedical domain, and has applied its methods to different problems, e.g. identification of protein-protein interactions, extraction of acronyms and identification of mutations. All solutions are implemented as stand-alone modules, which can be combined and used to process text data in a pipeline. Existing modules are made available to the public through EBI's central services ('Whatizit', now as SOAP Web Service).

In 2005 the team has developed a special information retrieval engine ('EBIMed'), which processes Medline abstracts and delivers the analysis to the user for fast exploration. During the analysis terms from a large set of terminologies, e.g. UniprotKB/Swiss-Prot protein names, gene ontology terms, drugs and species, are identified. In addition acronyms referring to protein names are disambiguated. This solution shows, how information retrieval and information extraction can be combined, how pattern matching and disambiguation modules can be combined in a pipeline of modules, and how research work contributes to a powerful solution for daily use.

Currently the group is involved in a number of initiatives that prepare text mining solutions for their use in the public (refer to Whatizit). One initiative started at the ISMB 2006 in Fortaleza, Brasil, and focuses on the development of solutions that eases the flow of information provided through the scientific literature into other electronic data repositories.

Rebholz group is headed by Dietrich Rebholz-Schuhmann.

For Text Mining Support please go to: http://www.ebi.ac.uk/Rebholz-srv/
For enquiries, problems with our software and membership to mailing lists, please email to: textmining-support@ebi.ac.uk.

Date: 19/20 April, 2010
Venue: European Bioinformatics Institute, Hinxton, Cambridge, U.K.
Registration: Register here.

At the CALBC workshop, participants will discuss the outcome of the challenge. The CALBC project partner will explain in detail previous work on the corpus and will present the results from the challenge. Participants will present their work to meet the demands of the challenge ... more

Date: 21/22 April 2010
Venue: European Bioinformatics Institute, Hinxton, Cambridge, U.K.

The European Bioinformatics Institute (EBI) and the National Centre for Text Mining (University of Manchester) are organising a joint training event at the EBI. The purpose of this event is to teach basic techniques in information retrieval (IR) and information extraction (IE) in the biomedical domain and to give hands-on training on existing solutions provided by the two centres ... more

Date: 25th -26th October, 2010
Venue: EBI, Hinxton, Cambridge, UK
The Fourth Symposium on Semantic Mining in Biomedicine (SMBM 2010) will be held at the European Bioinformatics Institute (EBI) in Hinxton, Cambridgeshire, UK on October 25th and 26th, 2010. SMBM 2010 aims to bring together researchers from text and data mining in biomedicine, medical, bio- and chemoinformatics, and researchers from biomedical ontology design and engineering ...more
spacer
spacer