Sciweavers

7387 search results - page 547 / 1478
» Knowledge-based data mining
Sort
View
ICDM
2005
IEEE
133views Data Mining» more  ICDM 2005»
15 years 11 months ago
Summarization - Compressing Data into an Informative Representation
In this paper, we formulate the problem of summarization of a dataset of transactions with categorical attributes as an optimization problem involving two objective functions - co...
Varun Chandola, Vipin Kumar
PAKDD
2005
ACM
146views Data Mining» more  PAKDD 2005»
15 years 11 months ago
An Incremental Data Stream Clustering Algorithm Based on Dense Units Detection
Abstract. The data stream model of computation is often used for analyzing huge volumes of continuously arriving data. In this paper, we present a novel algorithm called DUCstream ...
Jing Gao, Jianzhong Li, Zhaogong Zhang, Pang-Ning ...
KDD
1998
ACM
101views Data Mining» more  KDD 1998»
15 years 10 months ago
Probabilistic Modeling for Information Retrieval with Unsupervised Training Data
We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...
Ernest P. Chan, Santiago Garcia, Salim Roukos
KDD
2000
ACM
145views Data Mining» more  KDD 2000»
15 years 9 months ago
IntelliClean: a knowledge-based intelligent data cleaner
Existing data cleaning methods work on the basis of computing the degree of similarity between nearby records in a sorted database. High recall is achieved by accepting records wi...
Mong-Li Lee, Tok Wang Ling, Wai Lup Low
GRC
2008
IEEE
15 years 6 months ago
MovStream: An Efficient Algorithm for Monitoring Clusters Evolving in Data Streams
Monitoring cluster evolution in data streams is a major research topic in data streams mining. Previous clustering methods for evolving data streams focus on global clustering res...
Liang Tang, Chang-jie Tang, Lei Duan, Chuan Li, Ye...