Sciweavers

8 search results - page 2 / 2
» Progressive optimization in a shared-nothing parallel databa...
Sort
View
87
Voted
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
14 years 11 days ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...
DEBU
2010
128views more  DEBU 2010»
14 years 7 months ago
Panda: A System for Provenance and Data
Panda (for Provenance and Data) is a new project whose goal is to develop a general-purpose system that unifies concepts from existing provenance systems and overcomes some limita...
Robert Ikeda, Jennifer Widom
SIGMOD
2011
ACM
299views Database» more  SIGMOD 2011»
14 years 11 days ago
Processing theta-joins using MapReduce
Joins are essential for many data analysis tasks, but are not supported directly by the MapReduce paradigm. While there has been progress on equi-joins, implementation of join alg...
Alper Okcan, Mirek Riedewald