Sciweavers

110 search results - page 18 / 22
» mp 2007
Sort
View
EUROPAR
2007
Springer
15 years 1 months ago
On Using Incremental Profiling for the Performance Analysis of Shared Memory Parallel Applications
Abstract. Profiling is often the method of choice for performance analysis of parallel applications due to its low overhead and easily comprehensible results. However, a disadvanta...
Karl Fürlinger, Michael Gerndt, Jack Dongarra
DSD
2007
IEEE
142views Hardware» more  DSD 2007»
15 years 3 months ago
Decoupling of Computation and Communication with a Communication Assist
Abstract. In an embedded multiprocessor system the minimum throughput and maximum latency of real-time applications are usually derived given the worst-case execution time of the s...
Arno Moonen, Marco Bekooij, Rene van den Berg, Jef...
FCCM
2007
IEEE
117views VLSI» more  FCCM 2007»
15 years 3 months ago
FPGA Acceleration of Gene Rearrangement Analysis
In this paper we present our work toward FPGA acceleration of phylogenetic reconstruction, a type of analysis that is commonly performed in the fields of systematic biology and co...
Jason D. Bakos
IEEEPACT
2007
IEEE
15 years 3 months ago
AA-Sort: A New Parallel Sorting Algorithm for Multi-Core SIMD Processors
Many sorting algorithms have been studied in the past, but there are only a few algorithms that can effectively exploit both SIMD instructions and threadlevel parallelism. In this...
Hiroshi Inoue, Takao Moriyama, Hideaki Komatsu, To...
ISSAC
2007
Springer
163views Mathematics» more  ISSAC 2007»
15 years 3 months ago
POSIX threads polynomials(PTPol): a scalable implementation of univariate arithmetic operations
In this paper, we describe the design of a C library named PTPol implementing arithmetic operations for univariate polynomials and report on practical experiments showing the rele...
Mohab Safey El Din, Philippe Trebuchet