Sciweavers

350 search results - page 1 / 70
» An optimization framework for map-reduce queries
Sort
View
PVLDB
2010
179views more  PVLDB 2010»
13 years 2 months ago
MRShare: Sharing Across Multiple Queries in MapReduce
Large-scale data analysis lies in the core of modern enterprises and scientific research. With the emergence of cloud computing, the use of an analytical query processing infrast...
Tomasz Nykiel, Michalis Potamias, Chaitanya Mishra...
CORR
2011
Springer
259views Education» more  CORR 2011»
12 years 11 months ago
Automatic Optimization for MapReduce Programs
The MapReduce distributed programming framework has become popular, despite evidence that current implementations are inefficient, requiring far more hardware than a traditional r...
Eaman Jahani, Michael J. Cafarella, Christopher R&...
SIGMOD
2011
ACM
248views Database» more  SIGMOD 2011»
12 years 7 months ago
Llama: leveraging columnar storage for scalable join processing in the MapReduce framework
To achieve high reliability and scalability, most large-scale data warehouse systems have adopted the cluster-based architecture. In this paper, we propose the design of a new clu...
Yuting Lin, Divyakant Agrawal, Chun Chen, Beng Chi...
SIGMOD
2010
ACM
377views Database» more  SIGMOD 2010»
13 years 9 months ago
Online aggregation and continuous query support in MapReduce
MapReduce is a popular framework for data-intensive distributed computing of batch jobs. To simplify fault tolerance, the output of each MapReduce task and job is materialized to ...
Tyson Condie, Neil Conway, Peter Alvaro, Joseph M....
MSR
2009
ACM
13 years 9 months ago
MapReduce as a general framework to support research in Mining Software Repositories (MSR)
Researchers continue to demonstrate the benefits of Mining Software Repositories (MSR) for supporting software development and research activities. However, as the mining process...
Weiyi Shang, Zhen Ming Jiang, Bram Adams, Ahmed E....