Sciweavers

2609 search results - page 175 / 522
» Optimizing for parallelism and data locality
Sort
View
100
Voted
CPHYSICS
2007
138views more  CPHYSICS 2007»
15 years 1 months ago
UPIC: A framework for massively parallel particle-in-cell codes
The UCLA Parallel Particle-in-Cell (UPIC) Framework, is designed to provide trusted components for building a variety of parallel Particle-in-Cell (PIC) codes. It is based on the ...
Viktor K. Decyk
117
Voted
IPPS
2010
IEEE
14 years 11 months ago
Highly scalable parallel sorting
Sorting is a commonly used process with a wide breadth of applications in the high performance computing field. Early research in parallel processing has provided us with comprehen...
Edgar Solomonik, Laxmikant V. Kalé
131
Voted
PLDI
2009
ACM
15 years 8 months ago
Parallelizing sequential applications on commodity hardware using a low-cost software transactional memory
Multicore designs have emerged as the mainstream design paradigm for the microprocessor industry. Unfortunately, providing multiple cores does not directly translate into performa...
Mojtaba Mehrara, Jeff Hao, Po-Chun Hsu, Scott A. M...
111
Voted
HPDC
2006
IEEE
15 years 7 months ago
Optimal Bandwidth Sharing in Grid Environments
We consider the problem of bulk data transfers and bandwidth sharing in the context of grid infrastructures. Grid computing empowers high-performance computing in a large-scale di...
Loris Marchal, Pascale Vicat-Blanc Primet, Yves Ro...
EUROPAR
2001
Springer
15 years 5 months ago
Performance of High-Accuracy PDE Solvers on a Self-Optimizing NUMA Architecture
High-accuracy PDE solvers use multi-dimensional fast Fourier transforms. The FFTs exhibits a static and structured memory access pattern which results in a large amount of communic...
Sverker Holmgren, Dan Wallin