Sciweavers

506 search results - page 4 / 102
» Feature Selection for the Classification of Large Document C...
Sort
View
ICDM
2006
IEEE
193views Data Mining» more  ICDM 2006»
15 years 3 months ago
Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features
Several spatio-temporal data collected in many applications, such as fMRI data in medical applications, can be represented as a Multivariate Time Series (MTS) matrix with m rows (...
Hyunjin Yoon, Cyrus Shahabi
ACL
2004
14 years 11 months ago
The Sentimental Factor: Improving Review Classification Via Human-Provided Information
Sentiment classification is the task of labeling a review document according to the polarity of its prevailing opinion (favorable or unfavorable). In approaching this problem, a m...
Philip Beineke, Trevor Hastie, Shivakumar Vaithyan...
CIKM
2003
Springer
15 years 2 months ago
Online duplicate document detection: signature reliability in a dynamic retrieval environment
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retri...
Jack G. Conrad, Xi S. Guo, Cindy P. Schriber
100
Voted
DAS
2010
Springer
15 years 1 months ago
Nearest neighbor based collection OCR
Conventional optical character recognition (OCR) systems operate on individual characters and words, and do not normally exploit document or collection context. We describe a Coll...
K. Pramod Sankar, C. V. Jawahar, Raghavan Manmatha
DAS
2006
Springer
15 years 1 months ago
Retrieval from Document Image Collections
Abstract. This paper presents a system for retrieval of relevant documents from large document image collections. We achieve effective search and retrieval from a large collection ...
A. Balasubramanian, Million Meshesha, C. V. Jawaha...