Sciweavers

2228 search results - page 67 / 446
» Distributed Data Clustering Can Be Efficient and Exact
Sort
View
254
Voted
SIGMOD
2008
ACM
167views Database» more  SIGMOD 2008»
16 years 6 months ago
Efficient lineage tracking for scientific workflows
Data lineage and data provenance are key to the management of scientific data. Not knowing the exact provenance and processing pipeline used to produce a derived data set often re...
Thomas Heinis, Gustavo Alonso
142
Voted
WAIM
2009
Springer
16 years 21 days ago
SLICE: A Novel Method to Find Local Linear Correlations by Constructing Hyperplanes
Finding linear correlations in dataset is an important data mining task, which can be widely applied in the real world. Existing correlation clustering methods combine clustering w...
Liang Tang, Changjie Tang, Lei Duan, Yexi Jiang, J...
APWEB
2006
Springer
15 years 10 months ago
Generalized Projected Clustering in High-Dimensional Data Streams
Clustering is to identify densely populated subgroups in data, while correlation analysis is to find the dependency between the attributes of the data set. In this paper, we combin...
Ting Wang
ICARCV
2002
IEEE
92views Robotics» more  ICARCV 2002»
15 years 11 months ago
LTSD: a highly efficient symmetry-based robust estimator
Although the least median of squares (LMedS) method and the least trimmed squares (LTS) method are said to have a high breakdown point (50%), they can break down at unexpectedly l...
Hanzi Wang, David Suter
157
Voted
BMCBI
2010
100views more  BMCBI 2010»
15 years 6 months ago
Efficiency clustering for low-density microarrays and its application to QPCR
Background: Pathway-targeted or low-density arrays are used more and more frequently in biomedical research, particularly those arrays that are based on quantitative real-time PCR...
Eric F. Lock, Ryan Ziemiecki, J. S. Marron, Dirk P...