Sciweavers

IPM
2007
101views more  IPM 2007»
13 years 4 months ago
Decisions in thesaurus construction and use
A thesaurus and an ontology provide a set of structured terms, phrases, and metadata, often in a hierarchical arrangement, that may be used to index, search, and mine documents. W...
Robert M. Losee
IPM
2007
106views more  IPM 2007»
13 years 4 months ago
Patent document categorization based on semantic structural information
The number of patent documents is currently rising rapidly worldwide, creating the need for an automatic categorization system to replace time-consuming and labor-intensive manual...
Jae-Ho Kim, Key-Sun Choi
IPM
2007
149views more  IPM 2007»
13 years 4 months ago
Web page title extraction and its application
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...
IPM
2007
83views more  IPM 2007»
13 years 4 months ago
Information retrieval performance measures for a current awareness report composition aid
This papers studies a special “small” information retrieval problem where user satisfaction only depends on the ordering of documents. We look for a retrieval performance meas...
Thomas Krichel
IPM
2007
143views more  IPM 2007»
13 years 4 months ago
QCS: A system for querying, clustering and summarizing documents
Information retrieval systems consist of many complicated components. Research and development of such systems is often hampered by the difficulty in evaluating how each particula...
Daniel M. Dunlavy, Dianne P. O'Leary, John M. Conr...
IPM
2007
118views more  IPM 2007»
13 years 4 months ago
Cluster-based patent retrieval
Through the recent NTCIR workshops, patent retrieval casts many challenging issues to information retrieval community. Unlike newspaper articles, patent documents are very long an...
In-Su Kang, Seung-Hoon Na, Jungi Kim, Jong-Hyeok L...
IPM
2007
84views more  IPM 2007»
13 years 4 months ago
Multi-candidate reduction: Sentence compression as a tool for document summarization tasks
This article examines the application of two single-document sentence compression techniques to the problem of multi-document summarization—a “parse-and-trim” approach and a...
David M. Zajic, Bonnie J. Dorr, Jimmy J. Lin, Rich...
IPM
2007
105views more  IPM 2007»
13 years 4 months ago
Schema and constraints-based matching and merging of Topic Maps
In this paper, we propose a multi-strategic matching and merging approach to find correspondences between ontologies based on the syntactic or semantic characteristics and constr...
Jung-Min Kim, Hyopil Shin, Hyoung-Joo Kim
IPM
2007
114views more  IPM 2007»
13 years 4 months ago
s-grams: Defining generalized n-grams for information retrieval
For European languages, n-gram has proved to be the cost effective alternative to morphological processing during indexing task and it has been studied and analyzed extensively us...
Anni Järvelin, Antti Järvelin, Kalervo J...
IPM
2007
123views more  IPM 2007»
13 years 4 months ago
Generating gene summaries from biomedical literature: A study of semi-structured summarization
Most knowledge accumulated through scientific discoveries in genomics and related biomedical disciplines is buried in the vast amount of biomedical literature. Since understandin...
Xu Ling, Jing Jiang, Xin He, Qiaozhu Mei, Chengxia...