spacer

Literature Services - Internships: About CiteXplore

CiteXplore was launched recently with the aim to unite disparate literature ressources from the biomedical domain under one roof and so make the daily work of the biomedical researchers more productive and efficient. One of the resources we aim to integrate are the biomedical papers freely available on the net. This is complex undertaking including harvesting, transformation from a binary format (PDF) into a text-based one (HTML), identification and extraction of relevant metadata (paper title, authors, journal, etc.). One of the desired effects of this work is to allow us to create a citation map of the literature in the biomedical domain. Such a map would facilitate the navigation through the topically-related papers and would allow us to improve the sorting of the results by the citeXplore search engine (using an algorithm like PageRank or HIT).
spacer
spacer