Sciweavers

13 search results - page 2 / 3
» Using latent topic features to improve binary classification...
Sort
View
ICML
2006
IEEE
14 years 5 months ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
14 years 5 months ago
On the use of linear programming for unsupervised text classification
We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...
Mark Sandler
IJON
2006
78views more  IJON 2006»
13 years 4 months ago
Improving self-organization of document collections by semantic mapping
In text management tasks, the dimensionality reduction becomes necessary to computation and interpretability of the results generated by machine learning algorithms. This paper de...
Renato Fernandes Corrêa, Teresa Bernarda Lud...
WWW
2008
ACM
14 years 5 months ago
Detecting image spam using visual features and near duplicate detection
Email spam is a much studied topic, but even though current email spam detecting software has been gaining a competitive edge against text based email spam, new advances in spam g...
Bhaskar Mehta, Saurabh Nangia, Manish Gupta 0002, ...
DGO
2007
152views Education» more  DGO 2007»
13 years 6 months ago
Identifying and classifying subjective claims
To understand the subjective documents, for example, public comments on the government’s proposed regulation, opinion identification and classification is required. Rather than ...
Namhee Kwon, Liang Zhou, Eduard H. Hovy, Stuart W....