Sciweavers

1261 search results - page 185 / 253
» Extracting Text from PostScript
Sort
View
WWW
2006
ACM
15 years 10 months ago
A probabilistic approach to spatiotemporal theme pattern mining on weblogs
Mining subtopics from weblogs and analyzing their spatiotemporal patterns have applications in multiple domains. In this paper, we define the novel problem of mining spatiotempora...
Qiaozhu Mei, Chao Liu 0001, Hang Su, ChengXiang Zh...
DOCENG
2006
ACM
15 years 3 months ago
Evaluating invariances in document layout functions
With the development of variable-data-driven digital presses where each document printed is potentially unique there is a need for pre-press optimization to identify material that...
Alexander J. Macdonald, David F. Brailsford, John ...
DEXAW
2003
IEEE
136views Database» more  DEXAW 2003»
15 years 3 months ago
Ontology Based Semantic Similarity Comparison of Documents
In this work we consider ontologies as knowledge structures that specify terms, their properties and relations among them to enable knowledge extraction from texts. We represent o...
Vladimir A. Oleshchuk, Asle Pedersen
CHI
1999
ACM
15 years 2 months ago
The Reader's Helper: A Personalized Document Reading Environment
Over the last two centuries, reading styles have shifted away from the reading of documents from beginning to end and toward the skimming of documents in search of relevant inform...
Jamey Graham
DOCENG
2010
ACM
14 years 11 months ago
Picture detection in document page images
We present a method for picture detection in document page images, which can come from scanned or camera images, or rendered from electronic file formats. Our method uses OCR to s...
Patrick Chiu, Francine Chen, Laurent Denoue