The motivation of the IeXML project is to standardize the annotation of named entities in the life science literature that allows the interchange of annotated corpora independently of the underlying technology. Currently several efforts are directed to produce annotated corpora and solutions to perform evaluation of different approaches already working with this standard. These efforts have shown the benefits provided by using a common representation for the annotations.

If you are interested in taking part you can be added to the existing Google group dedicated to the project, just write an email to textmining-

Semantic Enrichment of Biomedical Literature

References and relevant documentation

Rebholz-Schuhmann, D., Kirsch, H., Nenadic, G. (2006) IeXML: towards a framework for interoperability of text processing modules to improve annotation of semantic types in biomedical text. BioLINK, ISMB 2006, Fortaleza, Brazil.

The presentation at SMBM 2008 can be found here.

The annotation guideline for the Multi-tagged corpus can be found here.