Sciweavers

1031 search results - page 61 / 207
» Managing the operator ordering problem in parallel databases
Sort
View
123
Voted
IWCC
1999
IEEE
15 years 4 months ago
A High Performance Communication Subsystem for PODOS
PODOS is a performance oriented distributed operating system being developed to harness the performance capabilities of a cluster computing environment. In order to address the gr...
Sudharshan Vazhkudai, P. Tobin Maginnis
116
Voted
KDD
2009
ACM
198views Data Mining» more  KDD 2009»
16 years 1 months ago
Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data
All Netflix Prize algorithms proposed so far are prohibitively costly for large-scale production systems. In this paper, we describe an efficient dataflow implementation of a coll...
Srivatsava Daruru, Nena M. Marin, Matt Walker, Joy...
103
Voted
SIGMOD
2008
ACM
100views Database» more  SIGMOD 2008»
15 years 14 days ago
Incorporating string transformations in record matching
Today's record matching infrastructure does not allow a flexible way to account for synonyms such as "Robert" and "Bob" which refer to the same name, and ...
Arvind Arasu, Surajit Chaudhuri, Kris Ganjam, Ragh...
122
Voted
SIGMOD
2011
ACM
221views Database» more  SIGMOD 2011»
14 years 3 months ago
Scalable query rewriting: a graph-based approach
In this paper we consider the problem of answering queries using views, which is important for data integration, query optimization, and data warehouses. We consider its simplest ...
George Konstantinidis, José Luis Ambite
CLUSTER
2006
IEEE
15 years 6 months ago
Resource Management for Interactive Jobs in a Grid Environment
1 Most recent Grid middleware technologies have been aimed at the execution of sequential batch jobs. However, some users require interactive access when running jobs on Grid sites...
Enol Fernández, Elisa Heymann, Miquel A. Se...