Sciweavers

8 search results - page 2 / 2
» Progressive optimization in a shared-nothing parallel databa...
Sort
View
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
12 years 7 months ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...
DEBU
2010
128views more  DEBU 2010»
13 years 2 months ago
Panda: A System for Provenance and Data
Panda (for Provenance and Data) is a new project whose goal is to develop a general-purpose system that unifies concepts from existing provenance systems and overcomes some limita...
Robert Ikeda, Jennifer Widom
SIGMOD
2011
ACM
299views Database» more  SIGMOD 2011»
12 years 7 months ago
Processing theta-joins using MapReduce
Joins are essential for many data analysis tasks, but are not supported directly by the MapReduce paradigm. While there has been progress on equi-joins, implementation of join alg...
Alper Okcan, Mirek Riedewald