Sciweavers

SC
2015
ACM
8 years 10 days ago
HPC I/O trace extrapolation
Today’s rapid development of supercomputers has caused I/O performance to become a major performance bottleneck for many scientific applications. Trace analysis tools have thus...
Xiaoqing Luo, Frank Mueller, Philip H. Carns, John...
SC
2015
ACM
8 years 10 days ago
Mantle: a programmable metadata load balancer for the ceph file system
Migrating resources is a useful tool for balancing load in a distributed system, but it is difficult to determine when to move resources, where to move resources, and how much of ...
Michael A. Sevilla, Noah Watkins, Carlos Maltzahn,...
SC
2015
ACM
8 years 10 days ago
PL2AP: fast parallel cosine similarity search
Solving the AllPairs similarity search problem entails finding all pairs of vectors in a high dimensional sparse dataset that have a similarity value higher than a given threshol...
David C. Anastasiu, George Karypis
SC
2015
ACM
8 years 10 days ago
Fault tolerant MapReduce-MPI for HPC clusters
Building MapReduce applications using the Message-Passing Interface (MPI) enables us to exploit the performance of large HPC clusters for big data analytics. However, due to the l...
SC
2015
ACM
8 years 10 days ago
Lessons Learned from Building In Situ Coupling Frameworks
Over the past few years, the increasing amounts of data produced by large-scale simulations have motivated a shift from traditional offline data analysis to in situ analysis and v...
Matthieu Dorier, Matthieu Dreher, Tom Peterka, Jus...
SC
2015
ACM
8 years 10 days ago
Performance and productivity of parallel python programming: a study with a CFD test case
The programming language Python is widely used to create rapidly compact software. However, compared to low-level programming languages like C or Fortran low performance is preven...
Achim Basermann, Melven Röhrig-Zöllner, ...
SC
2015
ACM
8 years 10 days ago
Comparative I/O workload characterization of two leadership class storage clusters
The Oak Ridge Leadership Computing Facility (OLCF) is a leader in large-scale parallel file system development, design, deployment and continuous operation. For the last decade, ...
Raghul Gunasekaran, Sarp Oral, Jason Hill, Ross Mi...
SC
2015
ACM
8 years 10 days ago
Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG
The last decade has seen power consumption move from an afterthought to the foremost design constraint of new supercomputers. Measuring the power of a supercomputer can be a daunt...
SC
2015
ACM
8 years 10 days ago
Relative debugging for a highly parallel hybrid computer system
Relative debugging traces software errors by comparing two executions of a program concurrently - one code being a reference version and the other faulty. Relative debugging is pa...
SC
2015
ACM
8 years 10 days ago
High-performance many-core networking: design and implementation
Jordi Ros-Giralt, Alan Commike, Dan Honey, Richard...