Sciweavers

374 search results - page 26 / 75
» A generative pattern model for mining binary datasets
Sort
View
84
Voted
SIGIR
2010
ACM
15 years 1 months ago
Self-taught hashing for fast similarity search
The ability of fast similarity search at large scale is of great importance to many Information Retrieval (IR) applications. A promising way to accelerate similarity search is sem...
Dell Zhang, Jun Wang, Deng Cai, Jinsong Lu
ICDM
2006
IEEE
138views Data Mining» more  ICDM 2006»
15 years 3 months ago
Adding Semantics to Email Clustering
This paper presents a novel algorithm to cluster emails according to their contents and the sentence styles of their subject lines. In our algorithm, natural language processing t...
Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Y...
97
Voted
BIBM
2007
IEEE
171views Bioinformatics» more  BIBM 2007»
15 years 4 months ago
GenMiner: Mining Informative Association Rules from Genomic Data
GENMINER is a smart adaptation of closed itemsets based association rules extraction to genomic data. It takes advantage of the novel NORDI discretization method and of the CLOSE ...
Ricardo Martínez, Claude Pasquier, Nicolas ...
KDD
2002
ACM
171views Data Mining» more  KDD 2002»
15 years 10 months ago
Mining complex models from arbitrarily large databases in constant time
In this paper we propose a scaling-up method that is applicable to essentially any induction algorithm based on discrete search. The result of applying the method to an algorithm ...
Geoff Hulten, Pedro Domingos
KDD
2004
ACM
103views Data Mining» more  KDD 2004»
15 years 10 months ago
An objective evaluation criterion for clustering
We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The...
Arindam Banerjee, John Langford