Sciweavers

106 search results - page 8 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
91
Voted
ICDM
2009
IEEE
176views Data Mining» more  ICDM 2009»
14 years 7 months ago
SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering
Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this pap...
Mohammad Salim Ahmed, Latifur Khan
ACL
2012
12 years 12 months ago
A Novel Burst-based Text Representation Model for Scalable Event Detection
Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of d...
Xin Zhao, Rishan Chen, Kai Fan, Hongfei Yan, Xiaom...
81
Voted
IRI
2007
IEEE
15 years 3 months ago
Enhancing Text Analysis via Dimensionality Reduction
Many applications require analyzing vast amounts of textual data, but the size and inherent noise of such data can make processing very challenging. One approach to these issues i...
David G. Underhill, Luke McDowell, David J. Marche...
79
Voted
ICONIP
1998
14 years 10 months ago
Automated Text Categorization Using Support Vector Machine
In this paper, we study the use of support vector machine in text categorization. Unlike other machine learning techniques, it allows easy incorporation of new documents into an e...
James Tin-Yau Kwok
ICPR
2008
IEEE
15 years 3 months ago
A robust technique for text extraction in mixed-type binary documents
A crucial preprocessing stage in applications such as OCR is text extraction from mixed-type documents. The present work, in contrast to most until now, successfully faces the pro...
Charalambos Strouthopoulos, Athanasios Nikolaidis