Sciweavers

581 search results - page 54 / 117
» Implementing the Best Processor Cores
Sort
View
114
Voted
HPCA
2009
IEEE
16 years 2 months ago
Adaptive Spill-Receive for robust high-performance caching in CMPs
In a Chip Multi-Processor (CMP) with private caches, the last level cache is statically partitioned between all the cores. This prevents such CMPs from sharing cache capacity in r...
Moinuddin K. Qureshi
113
Voted
ISPASS
2009
IEEE
15 years 8 months ago
Accuracy of performance counter measurements
Many workload characterization studies depend on accurate measurements of the cost of executing a piece of code. Often these measurements are conducted using infrastructures to ac...
Dmitrijs Zaparanuks, Milan Jovic, Matthias Hauswir...
113
Voted
ISVC
2009
Springer
15 years 8 months ago
Parallel Poisson Surface Reconstruction
In this work we describe a parallel implementation of the Poisson Surface Reconstruction algorithm based on multigrid domain decomposition. We compare implementations using differ...
Matthew Bolitho, Michael M. Kazhdan, Randal C. Bur...
ASPLOS
2011
ACM
14 years 5 months ago
Hardware acceleration of transactional memory on commodity systems
The adoption of transactional memory is hindered by the high overhead of software transactional memory and the intrusive design changes required by previously proposed TM hardware...
Jared Casper, Tayo Oguntebi, Sungpack Hong, Nathan...
EUROPAR
2007
Springer
15 years 8 months ago
MCSTL: The Multi-core Standard Template Library
1 Future gain in computing performance will not stem from increased clock rates, but from even more cores in a processor. Since automatic parallelization is still limited to easily...
Johannes Singler, Peter Sanders, Felix Putze