Sciweavers

33 search results - page 3 / 7
» Large-scale distributed storage for highly concurrent Mapred...
Sort
View
ICDM
2010
IEEE
189views Data Mining» more  ICDM 2010»
13 years 3 months ago
S4: Distributed Stream Computing Platform
Abstract--S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continu...
Leonardo Neumeyer, Bruce Robbins, Anish Nair, Anan...
CLOUDCOM
2010
Springer
13 years 3 months ago
Using Global Behavior Modeling to Improve QoS in Cloud Data Storage Services
Abstract--The cloud computing model aims to make largescale data-intensive computing affordable even for users with limited financial resources, that cannot invest into expensive i...
Jesús Montes, Bogdan Nicolae, Gabriel Anton...
HPDC
2010
IEEE
13 years 6 months ago
ROARS: a scalable repository for data intensive scientific computing
As scientific research becomes more data intensive, there is an increasing need for scalable, reliable, and high performance storage systems. Such data repositories must provide b...
Hoang Bui, Peter Bui, Patrick J. Flynn, Douglas Th...
CCGRID
2009
IEEE
13 years 3 months ago
Block-Based Concurrent and Storage-Aware Data Streaming for Grid Applications with Lots of Small Files
Data streaming management and scheduling is required by many grid computing applications, especially when the volume of data to be processed is extremely high while available stor...
Wen Zhang, Junwei Cao, Yisheng Zhong, Lianchen Liu...
OSDI
2008
ACM
13 years 8 months ago
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
DryadLINQ is a system and a set of language extensions that enable a new programming model for large scale distributed computing. It generalizes previous execution environments su...
Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Bud...