Sciweavers

KDD
2002
ACM
113views Data Mining» more  KDD 2002»
14 years 4 months ago
Scalable robust covariance and correlation estimates for data mining
Covariance and correlation estimates have important applications in data mining. In the presence of outliers, classical estimates of covariance and correlation matrices are not re...
Fatemah A. Alqallaf, Kjell P. Konis, R. Douglas Ma...
KDD
2002
ACM
109views Data Mining» more  KDD 2002»
14 years 4 months ago
MARK: a boosting algorithm for heterogeneous kernel models
Kristin P. Bennett, Michinari Momma, Mark J. Embre...
KDD
2002
ACM
157views Data Mining» more  KDD 2002»
14 years 4 months ago
Exploiting unlabeled data in ensemble methods
An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates between a...
Kristin P. Bennett, Ayhan Demiriz, Richard Maclin
KDD
2002
ACM
96views Data Mining» more  KDD 2002»
14 years 4 months ago
A theoretical framework for learning from a pool of disparate data sources
Shai Ben-David, Johannes Gehrke, Reba Schuller
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
14 years 4 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu
KDD
2002
ACM
115views Data Mining» more  KDD 2002»
14 years 4 months ago
Collaborative crawling: mining user experiences for topical resource discovery
The rapid growth of the world wide web had made the problem of topic speci c resource discovery an important one in recent years. In this problem, it is desired to nd web pages wh...
Charu C. Aggarwal
KDD
2002
ACM
119views Data Mining» more  KDD 2002»
14 years 4 months ago
On effective classification of strings with wavelets
In recent years, the technological advances in mapping genes have made it increasingly easy to store and use a wide variety of biological data. Such data are usually in the form o...
Charu C. Aggarwal
KDD
2002
ACM
109views Data Mining» more  KDD 2002»
14 years 4 months ago
Topics in 0--1 data
Large 0-1 datasets arise in various applications, such as market basket analysis and information retrieval. We concentrate on the study of topic models, aiming at results which in...
Ella Bingham, Heikki Mannila, Jouni K. Seppän...
KDD
2002
ACM
189views Data Mining» more  KDD 2002»
14 years 4 months ago
Sequential PAttern mining using a bitmap representation
We introduce a new algorithm for mining sequential patterns. Our algorithm is especially efficient when the sequential patterns in the database are very long. We introduce a novel...
Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Y...
KDD
2002
ACM
136views Data Mining» more  KDD 2002»
14 years 4 months ago
Relational Markov models and their application to adaptive web navigation
Relational Markov models (RMMs) are a generalization of Markov models where states can be of different types, with each type described by a different set of variables. The domain ...
Corin R. Anderson, Pedro Domingos, Daniel S. Weld