Sciweavers

ISHPC
2000
Springer
13 years 8 months ago
Loop Termination Prediction
Deeply pipelined high performance processors require highly accurate branch prediction to drive their instruction fetch. However there remains a class of events which are not easi...
Timothy Sherwood, Brad Calder
ISHPC
2000
Springer
13 years 8 months ago
Implementation and Evaluation of OpenMP for Hitachi SR8000
This paper describes the implementation and evaluation of the OpenMP compiler designed for the Hitachi SR8000 Super Technical Server. The compiler performs parallelization for the ...
Yasunori Nishitani, Kiyoshi Negishi, Hiroshi Ohta,...
ISHPC
2000
Springer
13 years 8 months ago
Leveraging Transparent Data Distribution in OpenMP via User-Level Dynamic Page Migration
This paper describes transparent mechanisms for emulating some of the data distribution facilities offered by traditional data-parallel programming models, such as High Performance...
Dimitrios S. Nikolopoulos, Theodore S. Papatheodor...
ISHPC
2000
Springer
13 years 8 months ago
Partitioned Parallel Radix Sort
Partitioned parallel radix sort is a parallel radix sort that shortens the execution time by modifying the load balanced radix sort which is known one of the fastest internal sort...
Shin-Jae Lee, Minsoo Jeon, Andrew Sohn, Dongseung ...
ISHPC
2000
Springer
13 years 8 months ago
The New DRAM Interfaces: SDRAM, RDRAM and Variants
For the past two decades, developments in DRAM technology, the primary technology for the main memory of computers, have been directed towards increasing density. As a result 256 M...
Brian Davis, Bruce L. Jacob, Trevor N. Mudge
ISHPC
2000
Springer
13 years 8 months ago
Limits of Task-Based Parallelism in Irregular Applications
Traditional parallel compilers do not effectively parallelize irregular applications because they contain little looplevel parallelism due to ambiguous memory references. We explo...
Barbara Kreaseck, Dean M. Tullsen, Brad Calder
ISHPC
2000
Springer
13 years 8 months ago
Automatic Generation of OpenMP Directives and Its Application to Computational Fluid Dynamics Codes
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technolo...
Haoqiang Jin, Michael A. Frumkin, Jerry C. Yan
ISHPC
2000
Springer
13 years 8 months ago
Compiler-Directed Cache Assist Adaptivity
Xiaomei Ji, Dan Nicolaescu, Alexander V. Veidenbau...
ISHPC
2000
Springer
13 years 8 months ago
Fast Isosurface Generation Using the Cell-Edge Centered Propagation Algorithm
Isosurface generation algorithms usually need a vertex-identification process since most of polygon-vertices of an isosurface are shared by several polygons. In our observation the...
Takayuki Itoh, Yasushi Yamaguchi, Koji Koyamada