Semantic Enrichment of the Scientific Literature Workshop
30, 31 March and 1 April 2009
Dear Associates,
We would like to invite you to the EBI workshop on Semantic Enrichment of the Scientific Literature which is co-organised and sponsored by the BootStrep project (contact Dietrich Rebholz-Schuhmann, rebholz@ebi.ac.uk) and the EBI Industry Programme (contact Dominic Clark, clark@ebi.ac.uk).
The workshop venue is the Wellcome Trust Conference Centre, Hinxton, Cambridge, UK.
Motivation
Over the last 10 years, innovation has changed the ways in which scientific publications are gathered and delivered to the public. Since the start of the electronic era:
- publishers have moved from paper presentation of their content to electronic delivery;
- the US National Library of Medicine (NLM) has opened up its archives and delivered Medline abstracts to the public in electronic form;
- open access publishers have been making their content freely available;
- curation teams are increasingly working with the publishers to gather ever more data and provide it to the public and
- proposals have been made that authors should contribute more details to their manuscript (FEBS letter experiment).
These changes require novel ways to capture and deliver the content to the public and to exchange the content and the annotations between different sites. This has led to increased activities to capture more information from the authors, to align it with the bioinformatics data resources, to deliver the content as part of the scientific literature and to improve the interoperability between existing automatic systems for text processing and exploitation.
This workshop will focus on semantic enrichment of the scientific literature. To this end, workshop participants will have the opportunity to hear about and discuss solutions that capture information from the authors directly and that deliver documents with their annotations. Furthermore, we will discuss the needs of different user groups for the benefits from the scientific literature of the future, e.g. librarians, researchers, automatic text processing and data mining research community, ontologists, others.
Participants
The participants in this workshop are all users who profit from better information retrieval (e.g., librarians and information scientists supporting industrial researchers) and information provision (e.g., bioinformatics research community). In addition, members of the text mining research community, members of publishing companies and industrial users of scientific information, curation teams and teams working on ontological or terminological resources.
Intended Outcomes
The intended outcomes of the workshop are:
- to exchange views on the reuse of literature in all possible ways and to the benefit of all involved parties;
- to have a better understanding for the infrastructure requirements coming from the automatic gathering and distribution of semantically enriched text (open standards and connectivity);
- to exchange views on what contributions could come from the publishers with regards to better exploitation of the publicly available literature and
- to assess current solutions for the gathering of semantic details from the authors while writing their scientific manuscripts.
Programme Overview
The workshop programme has been divided into three separate days – each with its own focus.
The foci are as follows:
- March 30th: Reliable factual data from the literature based on ontological resources (open meeting, Francis Crick Auditorium). Presentations and discussions on advanced solutions to model ontological resources for gathering facts from the literature and for integration into a fact database: gene regulatory events as a working example.
- March 31st: Semantic Enrichment of the literature for the benefit of all users (open meeting, Francis Crick Auditorium). Existing solutions for the processing and standardization of annotations in the scientific literature: authoring solutions, access to data through publishers, requirements from curators and “information scientists” in pharmaceutical and other industrial companies.
- April 1st: Efficient exchange of scientific literature for automatic exploitation and reuse of information (participation by invitation only, James Watson Pavilion). This is a focussed discussion forum that will address a number of issues and opportunities with key stakeholder groups.
Acknowledgement
We are grateful to Ian Dix (AstraZeneca) and Ian Harrow (Pfizer) for their advice in the construction of the workshop programme. This workshop is sponsored by the EC STREP project ‘BootStrep’ (FP6-028099, www.bootstrep.org ) and by the industry program at the EBI (http://www.ebi.ac.uk/industry/ind-prog-index.html).
Agenda:
 |