Sciweavers

138 search results - page 2 / 28
» An Analysis of Traces from a Production MapReduce Cluster
Sort
View
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
12 years 8 months ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...
PVLDB
2010
204views more  PVLDB 2010»
13 years 3 months ago
Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce
Large-scale data analysis has become increasingly important for many enterprises. Recently, a new distributed computing paradigm, called MapReduce, and its open source implementat...
Songting Chen
PVLDB
2010
167views more  PVLDB 2010»
13 years 3 months ago
The Performance of MapReduce: An In-depth Study
MapReduce has been widely used for large-scale data analysis in the Cloud. The system is well recognized for its elastic scalability and fine-grained fault tolerance although its...
Dawei Jiang, Beng Chin Ooi, Lei Shi, Sai Wu
IPCCC
2005
IEEE
13 years 11 months ago
Cluster-based input/output trace synthesis
I/O traces are crucial for understanding the performance of new storage architectures. Unfortunately, traces are extremely bursty and difficult to characterize. They are large, d...
Bo Hong, Tara M. Madhyastha, B. Zhang
ISPASS
2005
IEEE
13 years 11 months ago
Performance Analysis of a New Packet Trace Compressor based on TCP Flow Clustering
In this paper we study the properties of a new packet trace compression method based on clustering of TCP flows. With our proposed method, the compression ratio that we achieve i...
Raimir Holanda, Javier Verdú, Jorge Garc&ia...