Sciweavers

287 search results - page 42 / 58
» Coactive Learning for Distributed Data Mining
Sort
View
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
16 years 1 days ago
Probabilistic author-topic models for information discovery
We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...
115
Voted
SDM
2012
SIAM
216views Data Mining» more  SDM 2012»
13 years 2 months ago
Feature Selection "Tomography" - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable
:  Feature Selection “Tomography” - Illustrating that Optimal Feature Filtering is Hopelessly Ungeneralizable George Forman HP Laboratories HPL-2010-19R1 Feature selection; ...
George Forman
ICDM
2007
IEEE
289views Data Mining» more  ICDM 2007»
15 years 6 months ago
Latent Dirichlet Conditional Naive-Bayes Models
In spite of the popularity of probabilistic mixture models for latent structure discovery from data, mixture models do not have a natural mechanism for handling sparsity, where ea...
Arindam Banerjee, Hanhuai Shan
SDM
2010
SIAM
158views Data Mining» more  SDM 2010»
15 years 1 months ago
On the Use of Combining Rules in Relational Probability Trees
A relational probability tree (RPT) is a type of decision tree that can be used for probabilistic classification of instances with a relational structure. Each leaf of an RPT cont...
Daan Fierens
107
Voted
KDD
2004
ACM
158views Data Mining» more  KDD 2004»
16 years 1 days ago
A generalized maximum entropy approach to bregman co-clustering and matrix approximation
Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an informationtheoretic ...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...