Sciweavers

HIKM
2006
ACM

Automatic document indexing in large medical collections

14 years 4 months ago
Automatic document indexing in large medical collections
Term extraction relates to extracting the most characteristic or important terms (words or phrases) in a document. This information is commonly used for improving the accuracy of document indexing and retrieval in large text collections. It also allows for faster and better understanding of the contents of a document collection without first browsing through the contents of its documents. This paper presents AMTEX , an automatic term extraction method, specifically designed for the automatic indexing of documents in large medical collections such as MEDLINE, the premier bibliographic database of the U.S. National Library of Medicine (NLM). AMTEX combines MeSH, the terminological thesaurus resource of NLM, with a well-established method for extraction of domain terms, the C/NC-value method. The performance evaluation of various AMTEX configurations in the indexing task is measured against the current state-of-the-art, the MMTx method. The experimental results on a subset of MEDLINE d...
Angelos Hliaoutakis, Kalliopi Zervanou, Euripides
Added 13 Jun 2010
Updated 13 Jun 2010
Type Conference
Year 2006
Where HIKM
Authors Angelos Hliaoutakis, Kalliopi Zervanou, Euripides G. M. Petrakis, Evangelos E. Milios
Comments (0)