Search Sciweavers | Sciweavers

141

ICDM
2002
IEEE

191views Data Mining» more ICDM 2002»

Iterative Clustering of High Dimensional Text Data Augmented by Local Search

15 years 9 months ago

The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a popular method for clustering document collections. However, spherical k-means ca...

Inderjit S. Dhillon, Yuqiang Guan, J. Kogan

claim paper

Read More »

129

click to vote

AUSDM
2006
Springer

137views Data Mining» more AUSDM 2006»

A Study of Local and Global Thresholding Techniques in Text Categorization

15 years 8 months ago

Download crpit.com

Feature Filtering is an approach that is widely used for dimensionality reduction in text categorization. In this approach feature scoring methods are used to evaluate features le...

Nayer M. Wanas, Dina A. Said, Nevin M. Darwish, Na...

claim paper

Read More »

167

click to vote

LWA
2008

220views Software Engineering» more LWA 2008»

Rule-Based Information Extraction for Structured Data Acquisition using TextMarker

15 years 5 months ago

Download ki.informatik.uni-wuerzburg.de

Information extraction is concerned with the location of specific items in (unstructured) textual documents, e.g., being applied for the acquisition of structured data. Then, the ...

Martin Atzmüller, Peter Klügl, Frank Pup...

claim paper

Read More »

142

click to vote

SDM
2008
SIAM

133views Data Mining» more SDM 2008»

Semantic Smoothing for Bayesian Text Classification with Small Training Data

15 years 5 months ago

Download www.cis.drexel.edu

Bayesian text classifiers face a common issue which is referred to as data sparsity problem, especially when the size of training data is very small. The frequently used Laplacian...

Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu

claim paper

Read More »

109

click to vote

PKDD
2007
Springer

86views Data Mining» more PKDD 2007»

An Effective Approach to Enhance Centroid Classifier for Text Categorization

15 years 10 months ago

Download www.searchforum.org.cn

Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by i...

Songbo Tan, Xueqi Cheng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers