Sciweavers

139 search results - page 23 / 28
» An Empirical Comparison of Four Text Mining Methods
Sort
View
KDD
2006
ACM
165views Data Mining» more  KDD 2006»
16 years 1 days ago
Training linear SVMs in linear time
Linear Support Vector Machines (SVMs) have become one of the most prominent machine learning techniques for highdimensional sparse data commonly encountered in applications like t...
Thorsten Joachims
SDM
2008
SIAM
177views Data Mining» more  SDM 2008»
15 years 1 months ago
Cluster Ensemble Selection
This paper studies the ensemble selection problem for unsupervised learning. Given a large library of different clustering solutions, our goal is to select a subset of solutions t...
Xiaoli Z. Fern, Wei Lin
101
Voted
DAWAK
2008
Springer
15 years 1 months ago
Adapting LDA Model to Discover Author-Topic Relations for Email Analysis
Analyzing the author and topic relations in email corpus is an important issue in both social network analysis and text mining. The AuthorTopic model is a statistical model that id...
Liqiang Geng, Hao Wang, Xin Wang, Larry Korba
SDM
2012
SIAM
216views Data Mining» more  SDM 2012»
13 years 2 months ago
Feature Selection "Tomography" - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable
:  Feature Selection “Tomography” - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable George Forman HP Laboratories HPL-2010-19R1 Feature selection; ...
George Forman
WWW
2006
ACM
16 years 9 days ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...