large data sets | Sciweavers

20

IPPS
2000
IEEE

127views Distributed And Parallel Com...» more IPPS 2000»

PaDDMAS: Parallel and Distributed Data Mining Application Suite

13 years 9 months ago

Discovering complex associations, anomalies and patterns in distributed data sets is gaining popularity in a range of scientiﬁc, medical and business applications. Various algor...

Omer F. Rana, David W. Walker, Maozhen Li, Steven ...

claim paper

Read More »

14

click to vote

IDEAS
2000
IEEE

112views Database» more IDEAS 2000»

Bulk Loading a Data Warehouse Built Upon a UB-Tree

13 years 9 months ago

Download mistral.in.tum.de

This paper considers the issue of bulk loading large data sets for the UB-Tree, a multidimensional index structure. Especially in dataware housing (DW), data mining and OLAP it is...

Robert Fenk, Akihiko Kawakami, Volker Markl, Rudol...

claim paper

Read More »

12

click to vote

ICPR
2000
IEEE

103views computer vision» more ICPR 2000»

Scaling-Up Support Vector Machines Using Boosting Algorithm

13 years 9 months ago

Download www.datalab.uci.edu

In the recent years support vector machines (SVMs) have been successfully applied to solve a large number of classiﬁcation problems. Training an SVM, usually posed as a quadrati...

Dmitry Pavlov, Jianchang Mao, Byron Dom

claim paper

Read More »

11

click to vote

DASFAA
2010
IEEE

195views Database» more DASFAA 2010»

Transitivity-Preserving Skylines for Partially Ordered Domains

13 years 9 months ago

Download www.itee.uq.edu.au

The skyline of a set P of multi-dimensional points (tuples) consists of those points in P for which no clearly better point in P exists, using component-wise comparison on domains ...

Henning Köhler, Kai Zheng, Jing Yang, Xiaofan...

claim paper

Read More »

8

click to vote

IPPS
2002
IEEE

113views Distributed And Parallel Com...» more IPPS 2002»

Predicting the Performance of Wide Area Data Transfers

13 years 9 months ago

Download www.cct.lsu.edu

As Data Grids become more commonplace, large data sets are being replicated and distributed to multiple sites, leading to the problem of determining which replica can be accessed ...

Sudharshan Vazhkudai, Jennifer M. Schopf, Ian T. F...

claim paper

Read More »

11

click to vote

ICDM
2002
IEEE

159views Data Mining» more ICDM 2002»

O-Cluster: Scalable Clustering of Large High Dimensional Data Sets

13 years 9 months ago

Download www.dlsi.ua.es

Clustering large data sets of high dimensionality has always been a serious challenge for clustering algorithms. Many recently developed clustering algorithms have attempted to ad...

Boriana L. Milenova, Marcos M. Campos

claim paper

Read More »

11

click to vote

IPPS
2003
IEEE

144views Distributed And Parallel Com...» more IPPS 2003»

Simulation of Dynamic Data Replication Strategies in Data Grids

13 years 9 months ago

Download www.cs.rpi.edu

Data Grids provide geographically distributed resources for large-scale data-intensive applications that generate large data sets. However, ensuring efﬁcient access to such huge...

Houda Lamehamedi, Zujun Shentu, Boleslaw K. Szyman...

claim paper

Read More »

19

click to vote

KDD
2004
ACM

624views Data Mining» more KDD 2004»

Programming the K-means clustering algorithm in SQL

13 years 10 months ago

Download www.cs.uiuc.edu

Using SQL has not been considered an eﬃcient and feasible way to implement data mining algorithms. Although this is true for many data mining, machine learning and statistical a...

Carlos Ordonez

claim paper

Read More »

14

click to vote

SSD
2005
Springer

122views Database» more SSD 2005»

Selectivity Estimation of High Dimensional Window Queries via Clustering

13 years 10 months ago

Download www.dbs.informatik.uni-muenchen.de

Abstract. Query optimization is an important functionality of modern database systems and often based on estimating the selectivity of queries before actually executing them. Well-...

Christian Böhm, Hans-Peter Kriegel, Peer Kr&o...

claim paper

Read More »

14

click to vote

ISMDA
2005
Springer

151views Medical Imaging» more ISMDA 2005»

Simultaneous Scheduling of Replication and Computation for Bioinformatic Applications on the Grid

13 years 10 months ago

Download graal.ens-lyon.fr

Abstract. One of the ﬁrst motivations of using grids comes from applications managing large data sets like for example in High Energy Physic or Life Sciences. To improve the glob...

Frederic Desprez, Antoine Vernois, Christophe Blan...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers