PPOPP 2010 | Sciweavers

18

PPOPP
2010
ACM

234views Distributed and Parallel Com...» more PPOPP 2010»

Analyzing lock contention in multithreaded applications

13 years 3 months ago

Many programs exploit shared-memory parallelism using multithreading. Threaded codes typically use locks to coordinate access to shared data. In many cases, contention for locks r...

Nathan R. Tallent, John M. Mellor-Crummey, Allan P...

claim paper

Read More »

13

click to vote

PPOPP
2010
ACM

140views Distributed and Parallel Com...» more PPOPP 2010»

Helper locks for fork-join parallel programming

13 years 6 months ago

Download people.csail.mit.edu

Helper locks allow programs with large parallel critical sections, called parallel regions, to execute more efficiently by enlisting processors that might otherwise be waiting on ...

Kunal Agrawal, Charles E. Leiserson, Jim Sukha

claim paper

Read More »

14

click to vote

PPOPP
2010
ACM

141views Distributed and Parallel Com...» more PPOPP 2010»

Compiler aided selective lock assignment for improving the performance of software transactional memory

13 years 8 months ago

Download hpc.serc.iisc.ernet.in

Sandya Mannarswamy, Dhruva R. Chakrabarti, Kaushik...

claim paper

Read More »

17

click to vote

PPOPP
2010
ACM

308views Distributed and Parallel Com...» more PPOPP 2010»

Thread to strand binding of parallel network applications in massive multi-threaded systems

13 years 11 months ago

Download capinfo.e.ac.upc.edu

In processors with several levels of hardware resource sharing, like CMPs in which each core is an SMT, the scheduling process becomes more complex than in processors with a singl...

Petar Radojkovic, Vladimir Cakarevic, Javier Verd&...

claim paper

Read More »

19

click to vote

PPOPP
2010
ACM

259views Distributed and Parallel Com...» more PPOPP 2010»

An adaptive performance modeling tool for GPU architectures

13 years 11 months ago

Download impact.crhc.illinois.edu

This paper presents an analytical model to predict the performance of general-purpose applications on a GPU architecture. The model is designed to provide performance information ...

Sara S. Baghsorkhi, Matthieu Delahaye, Sanjay J. P...

claim paper

Read More »

16

click to vote

PPOPP
2010
ACM

240views Distributed and Parallel Com...» more PPOPP 2010»

Load balancing on speed

13 years 11 months ago

Download hpcrd.lbl.gov

To fully exploit multicore processors, applications are expected to provide a large degree of thread-level parallelism. While adequate for low core counts and their typical worklo...

Steven Hofmeyr, Costin Iancu, Filip Blagojevic

claim paper

Read More »

14

click to vote

PPOPP
2010
ACM

216views Distributed and Parallel Com...» more PPOPP 2010»

Structure-driven optimizations for amorphous data-parallel programs

14 years 1 months ago

Download users.ices.utexas.edu

Irregular algorithms are organized around pointer-based data structures such as graphs and trees, and they are ubiquitous in applications. Recent work by the Galois project has pr...

Mario Méndez-Lojo, Donald Nguyen, Dimitrios...

claim paper

Read More »

15

click to vote

PPOPP
2010
ACM

202views Distributed and Parallel Com...» more PPOPP 2010»

Applying the concurrent collections programming model to asynchronous parallel dense linear algebra

14 years 1 months ago

Download vuduc.org

This poster is a case study on the application of a novel programming model, called Concurrent Collections (CnC), to the implementation of an asynchronous-parallel algorithm for c...

Aparna Chandramowlishwaran, Kathleen Knobe, Richar...

claim paper

Read More »

19

click to vote

PPOPP
2010
ACM

210views Distributed and Parallel Com...» more PPOPP 2010»

Scheduling support for transactional memory contention management

14 years 1 months ago

Download www.cs.bgu.ac.il

Transactional Memory (TM) is considered as one of the most promising paradigms for developing concurrent applications. TM has been shown to scale well on multiple cores when the d...

Walther Maldonado, Patrick Marlier, Pascal Felber,...

claim paper

Read More »

11

click to vote

PPOPP
2010
ACM

209views Distributed and Parallel Com...» more PPOPP 2010»

Model-driven autotuning of sparse matrix-vector multiply on GPUs

14 years 1 months ago

Download vuduc.org

We present a performance model-driven framework for automated performance tuning (autotuning) of sparse matrix-vector multiply (SpMV) on systems accelerated by graphics processing...

Jee W. Choi, Amik Singh, Richard W. Vuduc

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers