Sciweavers

2228 search results - page 55 / 446
» Distributed Data Clustering Can Be Efficient and Exact
Sort
View
SIGMOD
2003
ACM
119views Database» more  SIGMOD 2003»
16 years 6 months ago
Robust and Efficient Fuzzy Match for Online Data Cleaning
To ensure high data quality, data warehouses must validate and cleanse incoming data tuples from external sources. In many situations, clean tuples must match acceptable tuples in...
Surajit Chaudhuri, Kris Ganjam, Venkatesh Ganti, R...
AUSDM
2007
Springer
145views Data Mining» more  AUSDM 2007»
16 years 7 days ago
Temporal Pattern Matching for the Prediction of Stock Prices
Time series data poses a significant variation to the traditional segmentation techniques of data mining because the observation is derived from multiple instances of the same und...
Richi Nayak, Paul te Braak
139
Voted
KDD
2003
ACM
109views Data Mining» more  KDD 2003»
16 years 6 months ago
Generative model-based clustering of directional data
High dimensional directional data is becoming increasingly important in contemporary applications such as analysis of text and gene-expression data. A natural model for multivaria...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...
ICDE
2008
IEEE
161views Database» more  ICDE 2008»
16 years 7 months ago
Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data
on Uncertain Data (Extended Abstract) Ming Hua Jian Pei Wenjie Zhang Xuemin Lin Simon Fraser University, Canada The University of New South Wales & NICTA {mhua, jpei}@cs.sfu.c...
Ming Hua, Jian Pei, Wenjie Zhang, Xuemin Lin
CORR
2010
Springer
138views Education» more  CORR 2010»
15 years 6 months ago
Data Stream Clustering: Challenges and Issues
Very large databases are required to store massive amounts of data that are continuously inserted and queried. Analyzing huge data sets and extracting valuable pattern in many appl...
Madjid Khalilian, Norwati Mustapha