Sciweavers

3530 search results - page 115 / 706
» Technology of Text Mining
Sort
View
ICDM
2002
IEEE
191views Data Mining» more  ICDM 2002»
15 years 9 months ago
Iterative Clustering of High Dimensional Text Data Augmented by Local Search
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a popular method for clustering document collections. However, spherical k-means ca...
Inderjit S. Dhillon, Yuqiang Guan, J. Kogan
AUSDM
2006
Springer
137views Data Mining» more  AUSDM 2006»
15 years 8 months ago
A Study of Local and Global Thresholding Techniques in Text Categorization
Feature Filtering is an approach that is widely used for dimensionality reduction in text categorization. In this approach feature scoring methods are used to evaluate features le...
Nayer M. Wanas, Dina A. Said, Nevin M. Darwish, Na...
LWA
2008
15 years 5 months ago
Rule-Based Information Extraction for Structured Data Acquisition using TextMarker
Information extraction is concerned with the location of specific items in (unstructured) textual documents, e.g., being applied for the acquisition of structured data. Then, the ...
Martin Atzmüller, Peter Klügl, Frank Pup...
SDM
2008
SIAM
133views Data Mining» more  SDM 2008»
15 years 5 months ago
Semantic Smoothing for Bayesian Text Classification with Small Training Data
Bayesian text classifiers face a common issue which is referred to as data sparsity problem, especially when the size of training data is very small. The frequently used Laplacian...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
PKDD
2007
Springer
86views Data Mining» more  PKDD 2007»
15 years 10 months ago
An Effective Approach to Enhance Centroid Classifier for Text Categorization
Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by i...
Songbo Tan, Xueqi Cheng