Retrieval with gene queries

8 years 10 months ago
Retrieval with gene queries
Background: Accuracy of document retrieval from MEDLINE for gene queries is crucially important for many applications in bioinformatics. We explore five information retrieval-based methods to rank documents retrieved by PubMed gene queries for the human genome. The aim is to rank relevant documents higher in the retrieved list. We address the special challenges faced due to ambiguity in gene nomenclature: gene terms that refer to multiple genes, gene terms that are also English words, and gene terms that have other biological meanings. Results: Our two baseline ranking strategies are quite similar in performance. Two of our three LocusLink-based strategies offer significant improvements. These methods work very well even when there is ambiguity in the gene terms. Our best ranking strategy offers significant improvements on three different kinds of ambiguities over our two baseline strategies
Aditya Kumar Sehgal, Padmini Srinivasan
Added 10 Dec 2010
Updated 10 Dec 2010
Type Journal
Year 2006
Authors Aditya Kumar Sehgal, Padmini Srinivasan
Comments (0)