![]() |
Rebholz Group PublicationsThe following is a list of publications for the Rebholz group.2011The CALBC RDF Triple store: retrieval over large literature content.
BMC Bioinformatics (To appear) A common layer of interoperability for biomedical ontologies based on OWL EL. Bioinformatics.
2011 Feb 21. [Epub ahead of print] Ontology design patterns to disambiguate relations between genes and gene products in GENIA.
J Biomedical Semantics (To appear). Improving the extraction of complex regulatory events from scientific text by using ontology-based inference.
J Biomedical Semantics (To appear). UKPMC: a full text article resource for the life sciences.
Nucleic Acids Res. 39 (Database issue):D58-65. Epub 2010 Nov 9. Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.
J Biomedical Semantics (To appear).
2010"Interoperability between phenotype and anatomy ontologies."
Bioinformatics. 2010 Oct 22. "Relations as patterns: bridging the gap between OBO and OWL."
BMC Bioinformatics. 2010 Aug 31;11:441. "Wrestling with biomedical research results: Language resources and literature analysis."
J Bioinform Comput Biol. 2010 Feb;8(1):129-130. "PaperMaker: validation of biomedical scientific publications."
Bioinformatics 26(7):982-4 "CALBC Silver Standard Corpus."
J Bioinform Comput Biol. 2010 Feb;8(1):163-79. The CALBC Silver Standard Corpus for Biomedical Named Entities: A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers.
Proc. LREC 2010. "Automatic Text Analysis for Bioinformatics Knowledge Discovery".
In: "Knowledge-based Bioinformatics". Editors: A. Bajpai. (To appear) "Biomedical Semantics: the Hub for Biomedical Research 2.0."
J Biomed Semantics. 2010 Mar 31;1(1):1. "Measuring prediction capacity of individual verbs for the identification of protein interactions".
Journal of biomedical informatics, 43, 200-207. "Assessment of NER solutions against the first and second CALBC Silver Standard Corpus."
Proc. of SMBM 2010, Cambridge, U.K.
2009""Between proteins and phenotypes: annotation and interpretation of mutations".
BMC Bioinformatics, 10 (Suppl 8):I1. Verification of Uncurated Protein Annotations.
In Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration. IGI Global Publishing.
How Feasible and Robust is the Automatic Extraction of Gene Regulation Events? A Cross-Method Evaluation under Lab and Real-Life Conditions.
BioNLP 2009, Bolder, Colorado.
Terminological cleansing for improved retrieval based on ontological terms, Exploiting Semantic Annotations in Information Retrieval
ESAIR 2009, Barcelona, Es.
Use of shared lexical resources for efficient ontological engineering.
BMC Bioinformatics (accepted for publication)
Exploitation of ontological resources for scientific literature analysis: searching genes and related diseases.
Journal IEEE Engineering in Medicine and Biology Society (EMBC, accepted for publication).
Ontology refinement for improved information retrieval.
Journal of Information Processing and Management (accepted for publication)
Terminological cleansing for improved information retrieval based on ontological terms.
Proc. WSDM 2009, Barcelona, Spain, (2009): p 6-14.
Annotation of protein residues based on a literature analysis: cross-validation against UniProtKB.
BMC Bioinformatics, Special issue (accepted for publication).
Using Biomedical Terminological Resources for Information Retrieval.
In Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration. IGI Global Publishing. (In press)
MeSH Up: effective MeSH text classification for improved document retrieval.
Bioinformatics. (2009):1412-8. Epub 2009 Apr 17.
PMID: 19376821 2008Text Mining for Biology - the Way Forward: Opinions from Leading Scientists.
Genome Biology 9, no. SUPPL.
PMID: 18834498 Gene Regulation Ontology (GRO): Design Principles and Use Cases.
MIE, Stockholm, 26-29 May 2008
Towards Knowledge in the Cloud.
OTM 2008 Workshops including SEMELS, Monterrey, Mexico, Nov. 9-14, 2008, Proceedings Series: Lecture Notes in Computer Science, Subseries: Information Systems and Applications, incl. Internet/Web, and HCI , Vol. 5333, Meersman, Robert; Tari, Zahir; Herrero, Pilar (Eds.) 2008, XXXV, 1090 p., ISBN: 978-3-540-88874-1
Combining Evidence, Specificity, and Proximity towards the Normalization of Gene Ontology Terms in Text.
EURASIP Journal on Bioinformatics and Systems Biology, vol. 2008, Article ID 342746. doi:10.1155/2008/342746.
abstract
Integrating Protein-Protein Interactions and Text Mining for Protein Function Prediction.
BMC Bioinformatics 9(8): S2; [Epub ahead of print]
PMID: 18673526
Use of shared lexical resources for efficient ontological engineering.
Semantic Web Applications and Tools for Life Sciences, 2008
Assessment of Disease Named Entity Recognition on a Corpus of Annotated Sentences.
BMC Bioinformatics 9, no. SUPPL. 3 (2008): Article S3.
PMID: 18426548MedEvi: Retrieving textual evidence of relations between biomedical concepts from Medline.
Bioinformatics 2008 Apr 9; [Epub ahead of print]
PMID: 18400773
Categorization of services for seeking information in biomedical literature: a typology for improvement of practice.
Brief Bioinform. 2008 Jul 26; [Epub ahead of print]
PMID: 18660511
Annotation of protein residues based on a literature analysis: cross-validation against UniProtKB Proc.
ECCB 2008 Workshop: Annotation, Interpretation and Management of Mutations (AIMM), Cagliari, Sardinia, Italy, September, June 22, 2008. (Published on CEUR-WS: 17-Dec-2008)
Static Dictionary Features for Term Polysemy Identification.
Proc Lang Res Eval, Conf (LREC-2008), workshop on "Building and evaluating resources for biomedical text mining", Marrakech (Morocco), 28-30 May 2008 (accepted).
MedEvi - A permuted concordancer for the biomedical domain.
Corpus Linguistics, Computer Tools, and Applications - State of the Art (PALC 2007), edited by B. Lewandowska-Tomaszczyk, Peter Lang, pp. 85-96, 2008.
Measuring performance of verbs denoting modifying and non-modifying protein interactions.
In Proceedings of the Third International Symposium on Semantic Mining in Biomedicine (SMBM 2008), Turku, Finland. Ed. Salakoski T., Rebholz-Schuhmann D., Pyysalo S. pp. 109--116. Turku Centre for Computer Science (TUCS).
Text Processing through Web Services: Calling Whatizit.
Bioinformatics 24, no. 2 (2008): 296-98.
PMID: 18006544 BioLexicon: A Lexical Resource for the Biology Domain.
In Proceedings of the Third International Symposium on Semantic Mining in Biomedicine (SMBM 2008), Turku, Finland. Ed. Salakoski T., Rebholz-Schuhmann D., Pyysalo S. pp. 109--116. Turku Centre for Computer Science (TUCS).
Facilitating the Development of Controlled Vocabularies for Metabolomics Technologies with Text Mining.
BMC Bioinformatics 9, no. SUPPL. 5 (2008): Article S5.
PMID: 18460187 2007Medical informatics and bioinformatics: a bibliometric study.
IEEE Trans Inf Technol Biomed. 11(3): 237-43.
PMID: 17521073
Assessment of diseases named entity recognition on a corpus of annotated sentences.
Proc of LBM 2007, Singpore, Dec 2007 (accepted for publication in BMC Bioinformatics)
Information Retrieval and Information Extraction in TREC Genomics 2007.
TREC Genomics competition, Washington, U.S.A. (Nov 2007) Text processing through Web services: Calling Whatizit.
Bioinformatics 2007 24(2): 296-298.
abstract EBIMed – Text crunching to gather facts for proteins from Medline.
Bioinformatics 23(2): e237-e244.
abstract
Facilitating the development of controlled vocabularies for metabolomics with text mining, in ISMB/ECCB
Special Interest Group (SIG) Meeting Program Materials, Bio-Ontologies SIG Workshop, Vienna, Austria, pp. 103-106 (accepted for publication in BMC Bioinformatics)
2006Dealing with repetitions in sequencing by hybridization.
J Comp Biol Chem. 2006 Oct;30(5):313-20. Epub 2006 Aug 30
PMID: 16945587
GOAnnotator: linking protein GO annotations to evidence text.
J Biomed Discov Collab. Dec 20;1(1):19
PMID: 17181854
Distributed modules for text annotation and IE applied to the biomedical domain.
Int J Med Inform. 75(6), 496-500.
PMID: 16085453
SYMBiomatics: Synergies in Medical Informatics and Bioinformatics – exploring current scientific literature for emerging topics.
BITS 2006, Bologna.
Annotation and Disambiguation of Semantic Types in Biomedical Text: a Cascaded Approach to Named Entity Recognition.
Workshop on multidimensional markup with Xml (XMLNLP), EACL 2006, Trente, Italy.
IeXML: towards a framework for interoperability of text processing modules to improve annotation of semantic types in biomedical text.
BioLINK, ISMB 2006, Fortaleza, Brazil.
Using argumentation to extract key sentences from biomedical abstracts.
Int J Med Inform. 76(2-3): 195-200.
PMID: 16815739
2005Resolving abbreviations to their senses in Medline.
Bioinformatics 21(18):3658-64
PMID: 16037121
LLL’05 Challenge: Genic Interaction Extraction with Alignments and Finite State Automata.In 4th Learning Language in Logic Workshop (LLL05) at the 22nd Int
Conf on Machine Learning (James Cussens, Claire Nédellec), 38-45
Stud Health Technol Inform. 116:835-40.
more info
Facts from text-is text mining ready to deliver?
PLos Biol., 3 (2): e65.
PMID: 15719064
Facts from Text – information extraction online. In Dagstuhl Seminar 05441 “Managing and Mining Genome Information:Frontiers in Bioinformatics (Blazewicz, J., et al., eds)
Int J Med Inform. 75(6), 496-500.
more Info
Extracting key sentences with latent argumentative structuring.
Stud Health Technol Inform. 116:835-40.
PMID: 16160362
Protein annotation by EBIMed
Nature Biotechnology, 24(8), 902-903.
PMID: 16900125
2004Extraction of biomedical facts - a modular Web server at the EBI (Whatizit).
Proceedings HDL 2004, Bath, UK.
PMID: 12738764
Automatic extraction of mutations from Medline and cross-validation with OMIM.
Nucleic Acids Res, 32(1):135--142.
Stud Health Technol Inform. 116:835-40.
PMID: 14704350
2003Computer-assisted generation of a protein-interaction database for nuclear receptors.
J. Mol. Endocrinol., 17 (8): 1555-67.
PMID: 12738764
Workshop ProceedingsExtraction of biomedical facts - a modular Web server at the EBI (Whatizit).
Proceedings HDL 2004, Bath, UK.
PMID: 12738764
Selected Poster PresentationsWhatizit - A Server Pipeline on a Linux cluster for Text Annotation in the Biomedical Domain.
ISMB 2004, Glasgow, UK.
Link: Poster@ISMB 2004
![]() |