parallel computing | Sciweavers

26

PPOPP
2010
ACM

156views Distributed and Parallel Com...» more PPOPP 2010»

Modeling advanced collective communication algorithms on cell-based systems

14 years 6 months ago

This paper presents and validates performance models for a variety of high-performance collective communication algorithms for systems with Cell processors. The systems modeled in...

Qasim Ali, Samuel P. Midkiff, Vijay S. Pai

claim paper

Read More »

25

click to vote

PPOPP
2010
ACM

223views Distributed and Parallel Com...» more PPOPP 2010»

Application heartbeats for software performance and health

14 years 6 months ago

Download groups.csail.mit.edu

Adaptive, or self-aware, computing has been proposed to help application programmers confront the growing complexity of multicore software development. However, existing approache...

Henry Hoffmann, Jonathan Eastep, Marco D. Santambr...

claim paper

Read More »

22

click to vote

PPOPP
2010
ACM

179views Distributed and Parallel Com...» more PPOPP 2010»

Modeling transactional memory workload performance

14 years 6 months ago

Download www.cs.utexas.edu

Transactional memory promises to make parallel programming easier than with fine-grained locking, while performing just as well. This performance claim is not always borne out bec...

Donald E. Porter, Emmett Witchel

claim paper

Read More »

35

click to vote

PPOPP
2010
ACM

202views Distributed and Parallel Com...» more PPOPP 2010»

Lazy binary-splitting: a run-time adaptive work-stealing scheduler

14 years 6 months ago

Download www.umiacs.umd.edu

We present Lazy Binary Splitting (LBS), a user-level scheduler of nested parallelism for shared-memory multiprocessors that builds on existing Eager Binary Splitting work-stealing...

Alexandros Tzannes, George C. Caragea, Rajeev Baru...

claim paper

Read More »

27

click to vote

PPOPP
2010
ACM

191views Distributed and Parallel Com...» more PPOPP 2010»

Scalable communication protocols for dynamic sparse data exchange

14 years 6 months ago

Download www.unixer.de

Many large-scale parallel programs follow a bulk synchronous parallel (BSP) structure with distinct computation and communication phases. Although the communication phase in such ...

Torsten Hoefler, Christian Siebert, Andrew Lumsdai...

claim paper

Read More »

40

click to vote

PPOPP
2010
ACM

203views Distributed and Parallel Com...» more PPOPP 2010»

GAMBIT: effective unit testing for concurrency libraries

14 years 6 months ago

Download www.cs.utexas.edu

As concurrent programming becomes prevalent, software providers are investing in concurrency libraries to improve programmer productivity. Concurrency libraries improve productivi...

Katherine E. Coons, Sebastian Burckhardt, Madanlal...

claim paper

Read More »

21

click to vote

PPOPP
2010
ACM

171views Distributed and Parallel Com...» more PPOPP 2010»

Debugging programs that use atomic blocks and transactional memory

14 years 6 months ago

Download research.microsoft.com

Ferad Zyulkyarov, Tim Harris, Osman S. Unsal, Adri...

claim paper

Read More »

22

click to vote

PPOPP
2010
ACM

194views Distributed and Parallel Com...» more PPOPP 2010»

NOrec: streamlining STM by abolishing ownership records

14 years 6 months ago

Download www.cs.rochester.edu

Drawing inspiration from several previous projects, we present an ownership-record-free software transactional memory (STM) system that combines extremely low overhead with unusua...

Luke Dalessandro, Michael F. Spear, Michael L. Sco...

claim paper

Read More »

41

click to vote

PPOPP
2010
ACM

353views Distributed and Parallel Com...» more PPOPP 2010»

Data transformations enabling loop vectorization on multithreaded data parallel architectures

14 years 6 months ago

Download www.ece.neu.edu

Loop vectorization, a key feature exploited to obtain high performance on Single Instruction Multiple Data (SIMD) vector architectures, is significantly hindered by irregular memo...

Byunghyun Jang, Perhaad Mistry, Dana Schaa, Rodrig...

claim paper

Read More »

26

click to vote

PPOPP
2010
ACM

199views Distributed and Parallel Com...» more PPOPP 2010»

Symbolic prefetching in transactional distributed shared memory

14 years 6 months ago

Download demsky.eecs.uci.edu

We present a static analysis for the automatic generation of symbolic prefetches in a transactional distributed shared memory. A symbolic prefetch specifies the first object to be...

Alokika Dash, Brian Demsky

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers