Sciweavers

3152 search results - page 315 / 631
» Retrieval of Partial Documents
Sort
View
IPM
2007
106views more  IPM 2007»
15 years 5 months ago
Patent document categorization based on semantic structural information
The number of patent documents is currently rising rapidly worldwide, creating the need for an automatic categorization system to replace time-consuming and labor-intensive manual...
Jae-Ho Kim, Key-Sun Choi
145
Voted
PVLDB
2010
115views more  PVLDB 2010»
15 years 3 months ago
ROXXI: Reviving witness dOcuments to eXplore eXtracted Information
In recent years, there has been considerable research on information extraction and constructing RDF knowledge bases. In general, the goal is to extract all relevant information f...
Shady Elbassuoni, Katja Hose, Steffen Metzger, Ral...
138
Voted
SIGIR
2002
ACM
15 years 5 months ago
Cross-lingual relevance models
We propose a formal model of Cross-Language Information Retrieval that does not rely on either query translation or document translation. Our approach leverages recent advances in...
Victor Lavrenko, Martin Choquette, W. Bruce Croft
CIKM
2009
Springer
15 years 12 months ago
The impact of document structure on keyphrase extraction
Keyphrases are short phrases that reflect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...
Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...
119
Voted
JCDL
2005
ACM
100views Education» more  JCDL 2005»
15 years 11 months ago
Automatic extraction of titles from general documents using machine learning
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...
Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...