Sciweavers

HOTOS
2009
IEEE

On Availability of Intermediate Data in Cloud Computations

13 years 8 months ago
On Availability of Intermediate Data in Cloud Computations
This paper takes a renewed look at the problem of managing intermediate data that is generated during dataflow computations (e.g., MapReduce, Pig, Dryad, etc.) within clouds. We discuss salient features of this intermediate data and outline requirements for a solution. Our experiments show that existing local writeremote read solutions, traditional distributed file systems (e.g., HDFS), and support from transport protocols (e.g., TCP-Nice) cannot guarantee both data availability and minimal interference, which are our key requirements. We present design ideas for a new intermediate data storage system.
Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil G
Added 16 Aug 2010
Updated 16 Aug 2010
Type Conference
Year 2009
Where HOTOS
Authors Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil Gupta
Comments (0)