Sciweavers

IPPS
2010
IEEE
13 years 2 months ago
Oversubscription on multicore processors
Abstract: Existing multicore systems already provide deep levels of thread parallelism. Hybrid programming models and composability of parallel libraries are very active areas of r...
Costin Iancu, Steven Hofmeyr, Filip Blagojevic, Yi...
JPDC
2006
106views more  JPDC 2006»
13 years 4 months ago
Performance characteristics of the multi-zone NAS parallel benchmarks
We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in realistic flow comput...
Haoqiang Jin, Rob F. Van der Wijngaart
PPSC
1997
13 years 6 months ago
New Implementations and Results for the NAS Parallel Benchmarks 2
We present new implementations and results for the NAS Parallel Benchmarks 2 suite. The suite currently consists of seven programs. Of these LU, SP, BT, MG and FT have previously ...
William Saphir, Rob F. Van der Wijngaart, Alex Woo...
SC
1991
ACM
13 years 8 months ago
Performance results for two of the NAS parallel benchmarks
Two problems from the recently published “NAS Parallel Benchmarks” have been implemented on three advanced parallel computer systems. These two benchmarks are the following: (...
David H. Bailey, Paul O. Frederickson
EUROPAR
2004
Springer
13 years 8 months ago
Overhead Compensation in Performance Profiling
Measurement-based profiling introduces intrusion in program execution. Intrusion effects can be mitigated by compensating for measurement overhead. Techniques for compensation anal...
Allen D. Malony, Sameer Shende
SC
1992
ACM
13 years 9 months ago
NAS Parallel Benchmark Results
The NAS Parallel Benchmarks have been developed at NASA Ames Research Center to study the performance of parallel supercomputers. The eight benchmark problems are specified in a &...
David H. Bailey, Leonardo Dagum, E. Barszcz, Horst...
IWCC
1999
IEEE
13 years 9 months ago
Comparative Performance of a Commodity Alpha Cluster Running Linux and Windows NT
Using a cluster of commodity Alpha processors we compare two software platforms based on Linux and Windows NT and intended to support intensive scientic computations. Networking a...
David Lancaster, Kenji Takeda
IPPS
2002
IEEE
13 years 9 months ago
Effective Cross-Platform, Multilevel Parallelism via Dynamic Adaptive Execution
This paper presents preliminary efforts to develop compilation and execution environments that achieve performance portability of multilevel parallelization on hierarchical archit...
Walden Ko, Mark N. Yankelevsky, Dimitrios S. Nikol...
LCPC
2005
Springer
13 years 10 months ago
Titanium Performance and Potential: An NPB Experimental Study
Titanium is an explicitly parallel dialect of JavaTM designed for high-performance scientific programming. It offers objectorientation, strong typing, and safe memory management...
Kaushik Datta, Dan Bonachea, Katherine A. Yelick
SC
2005
ACM
13 years 10 months ago
An Application-Based Performance Characterization of the Columbia Supercluster
Columbia is a 10,240-processor supercluster consisting of 20 Altix nodes with 512 processors each, and currently ranked as one of the fastest computers in the world. In this paper...
Rupak Biswas, M. Jahed Djomehri, Robert Hood, Haoq...