Sciweavers

403 search results - page 11 / 81
» Data Partitioning for Minimizing Transferred Data in MapRedu...
Sort
View
EUROPAR
2007
Springer
15 years 10 months ago
Are P2P Data-Dissemination Techniques Viable in Today's Data-Intensive Scientific Collaborations?
The interest among a geographically distributed user base to mine massive collections of scientific data propels the need for efficient data dissemination solutions. An optimal dat...
Samer Al-Kiswany, Matei Ripeanu, Adriana Iamnitchi...
CLOUD
2010
ACM
15 years 8 months ago
Making cloud intermediate data fault-tolerant
Parallel dataflow programs generate enormous amounts of distributed data that are short-lived, yet are critical for completion of the job and for good run-time performance. We ca...
Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil G...
ISCAPDCS
2003
15 years 5 months ago
Heterogeneous Hardware-Software System Partitioning using Extended Directed Acyclic Graph
In this paper, we present a system partitioning technique in which the input system specification is based on C++ language. The proposed technique processes data and precedence de...
Matthew Jin, Gul N. Khan
EUROSYS
2011
ACM
14 years 7 months ago
Scarlett: coping with skewed content popularity in mapreduce clusters
To improve data availability and resilience MapReduce frameworks use file systems that replicate data uniformly. However, analysis of job logs from a large production cluster show...
Ganesh Ananthanarayanan, Sameer Agarwal, Srikanth ...
272
Voted
DAWAK
2005
Springer
15 years 9 months ago
Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data
Abstract. In this paper, we present a new approach to indexing multidimensional data that is particularly suitable for the efficient incremental processing of nearest neighbor quer...
Evangelos Dellis, Bernhard Seeger, Akrivi Vlachou