Sciweavers

92 search results - page 11 / 19
» Motivations and Methods for Text Simplification
Sort
View
ACL
2010
14 years 7 months ago
Event-Based Hyperspace Analogue to Language for Query Expansion
Bag-of-words approaches to information retrieval (IR) are effective but assume independence between words. The Hyperspace Analogue to Language (HAL) is a cognitively motivated and...
Tingxu Yan, Tamsin Maxwell, Dawei Song, Yuexian Ho...
DAS
2010
Springer
14 years 7 months ago
Information extraction by finding repeated structure
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
Evgeniy Bart, Prateek Sarkar
DRR
2008
14 years 11 months ago
Word segmentation of off-line handwritten documents
Word segmentation is the most critical pre-processing step for any handwritten document recognition/retrieval system. This paper describes an approach to separate a line of uncons...
Chen Huang, Sargur N. Srihari
SIGIR
2002
ACM
14 years 9 months ago
Unsupervised document classification using sequential information maximization
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Noam Slonim, Nir Friedman, Naftali Tishby
ICDM
2005
IEEE
161views Data Mining» more  ICDM 2005»
15 years 3 months ago
Making Logistic Regression a Core Data Mining Tool with TR-IRLS
Binary classification is a core data mining task. For large datasets or real-time applications, desirable classifiers are accurate, fast, and need no parameter tuning. We presen...
Paul Komarek, Andrew W. Moore