Sciweavers

532 search results - page 25 / 107
» Clustering Text Data Streams
Sort
View
113
Voted
KDD
2004
ACM
103views Data Mining» more  KDD 2004»
16 years 3 months ago
An objective evaluation criterion for clustering
We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The...
Arindam Banerjee, John Langford
110
Voted
SAINT
2003
IEEE
15 years 8 months ago
Bayesian Analysis of Online Newspaper Log Data
In this paper we address the problem of analyzing web log data collected at a typical online newspaper site. We propose a two-way clustering technique based on probability theory....
Hannes Wettig, Jussi Lahtinen, Tuomas Lepola, Petr...
117
Voted
ERCIMDL
1997
Springer
106views Education» more  ERCIMDL 1997»
15 years 7 months ago
Scalable Text Retrieval for Large Digital Libraries
It is argued that digital libraries of the future will contain terabyte-scale collections of digital text and that full-text searching techniques will be required to operate over c...
David Hawking
129
Voted
ICML
2010
IEEE
15 years 4 months ago
Budgeted Nonparametric Learning from Data Streams
We consider the problem of extracting informative exemplars from a data stream. Examples of this problem include exemplarbased clustering and nonparametric inference such as Gauss...
Ryan Gomes, Andreas Krause
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
16 years 3 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...