Sciweavers

587 search results - page 65 / 118
» New Algorithms for Text Fingerprinting
Sort
View
ACL
2012
13 years 6 days ago
Labeling Documents with Timestamps: Learning from their Time Expressions
Temporal reasoners for document understanding typically assume that a document’s creation date is known. Algorithms to ground relative time expressions and order events often re...
Nathanael Chambers
VLDB
2002
ACM
161views Database» more  VLDB 2002»
14 years 9 months ago
Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over many s...
Panagiotis G. Ipeirotis, Luis Gravano
ICDE
2012
IEEE
227views Database» more  ICDE 2012»
13 years 7 days ago
Horizontal Reduction: Instance-Level Dimensionality Reduction for Similarity Search in Large Document Databases
—Dimensionality reduction is essential in text mining since the dimensionality of text documents could easily reach several tens of thousands. Most recent efforts on dimensionali...
Min-Soo Kim 0001, Kyu-Young Whang, Yang-Sae Moon
KDD
2006
ACM
141views Data Mining» more  KDD 2006»
15 years 10 months ago
Statistical entity-topic models
The primary purpose of news articles is to convey information about who, what, when and where. But learning and summarizing these relationships for collections of thousands to mil...
David Newman, Chaitanya Chemudugunta, Padhraic Smy...
DOCENG
2008
ACM
14 years 11 months ago
Satisficing scrolls: a shortcut to satisfactory layout
We present at a new approach to finding aesthetically pleasing page layouts. We do not aim to find an optimal layout, rather the aim is to find a layout which is not obviously wro...
Nathan Hurst, Kim Marriott