Sciweavers

CLUSTER
2008
IEEE
13 years 11 months ago
Redistribution aware two-step scheduling for mixed-parallel applications
— Applications raising in many scientific fields exhibit both data and task parallelism that have to be exploited efficiently. A classic approach is to structure those applica...
Sascha Hunold, Thomas Rauber, Frédér...
CLUSTER
2008
IEEE
13 years 11 months ago
A multicore-enabled multirail communication engine
—The current trend in clusters architecture leads toward a massive use of multicore chips. This hardware evolution raises bottleneck issues at the network interface level. The us...
Elisabeth Brunet, François Trahay, Alexandr...
CLUSTER
2008
IEEE
13 years 11 months ago
Supporting storage resources in Urgent Computing Environments
Abstract—The Special PRiority and Urgent Computing Environment (SPRUCE) provides on-demand access to highperformance computing resources for time-critical applications. While SPR...
Jason Cope, Henry M. Tufo
CLUSTER
2008
IEEE
13 years 11 months ago
Intelligent compilers
—The industry is now in agreement that the future of architecture design lies in multiple cores. As a consequence, all computer systems today, from embedded devices to petascale ...
John Cavazos
CLUSTER
2008
IEEE
13 years 11 months ago
An OSD-based approach to managing directory operations in parallel file systems
—Distributed file systems that use multiple servers to store data in parallel are becoming commonplace. Much work has already gone into such systems to maximize data throughput....
Nawab Ali, Ananth Devulapalli, Dennis Dalessandro,...
CLUSTER
2008
IEEE
13 years 11 months ago
A comparison of search heuristics for empirical code optimization
—This paper describes the application of various search techniques to the problem of automatic empirical code optimization. The search process is a critical aspect of auto-tuning...
Keith Seymour, Haihang You, Jack Dongarra
CLUSTER
2008
IEEE
13 years 11 months ago
Context-aware address translation for high performance SMP cluster system
—User-level communication allows an application process to access the network interface directly. Bypassing the kernel requires that a user process accesses the network interface...
Moon-Sang Lee, Joonwon Lee, Seungryoul Maeng
CLUSTER
2008
IEEE
13 years 11 months ago
OpenMP-centric performance analysis of hybrid applications
—Several performance analysis tools support hybrid applications. Most originated as MPI profiling or tracing tools and OpenMP capabilities were added to extend the performance a...
Karl Fürlinger, Shirley Moore
CLUSTER
2008
IEEE
13 years 11 months ago
In search of sweet-spots in parallel performance monitoring
—Parallel performance monitoring extends parallel measurement systems with infrastructure and interfaces for online performance data access, communication, and analysis. At the s...
Aroon Nataraj, Allen D. Malony, Allen Morris, Dori...