Sciweavers

241 search results - page 37 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
CIKM
2008
Springer
14 years 11 months ago
Combining concept hierarchies and statistical topic models
Statistical topic models provide a general data-driven framework for automated discovery of high-level knowledge from large collections of text documents. While topic models can p...
Chaitanya Chemudugunta, Padhraic Smyth, Mark Steyv...
90
Voted
JCDL
2009
ACM
162views Education» more  JCDL 2009»
15 years 4 months ago
Supporting analysis of future-related information in news archives and the web
A lot of future-related information is available in news articles or Web pages. This information can however differ to large extent and may fluctuate over time. It is therefore di...
Adam Jatowt, Kensuke Kanazawa, Satoshi Oyama, Kats...
ICDAR
2005
IEEE
15 years 3 months ago
An Old Greek Handwritten OCR System
Recognition of handwritten manuscripts is essential for efficient content exploitation of the valuable Old Greek historical collections. In this paper, we focus on the problem of ...
Kostas Ntzios, Basilios Gatos, Ioannis Pratikakis,...
CIKM
2004
Springer
15 years 1 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang
RIAO
1994
14 years 11 months ago
An Association Thesaurus for Information Retrieval
Although commonly used in both commercial and experimental information retrieval systems, thesauri have not demonstrated consistent bene ts for retrieval performance, and it is di...
Bruce Croft, Jing Yufeng