Search Sciweavers | Sciweavers

10

SPAA
2004
ACM

85views Distributed And Parallel Com...» more SPAA 2004»

Effectively sharing a cache among threads

13 years 10 months ago

We compare the number of cache misses M1 for running a computation on a single processor with cache size C1 to the total number of misses Mp for the same computation when using p ...

Guy E. Blelloch, Phillip B. Gibbons

claim paper

Read More »

15

click to vote

IPPS
2010
IEEE

144views Distributed And Parallel Com...» more IPPS 2010»

Restructuring parallel loops to curb false sharing on multicore architectures

13 years 2 months ago

Download www.cs.txstate.edu

The memory hierarchy of most multicore systems contains one or more levels of cache that is shared among multiple cores. The shared-cache architecture presents many opportunities f...

Santosh Sarangkar, Apan Qasem

claim paper

Read More »

19

click to vote

PPOPP
2010
ACM

232views Distributed and Parallel Com...» more PPOPP 2010»

Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?

14 years 1 months ago

Download www.cs.wm.edu

Most modern Chip Multiprocessors (CMP) feature shared cache on chip. For multithreaded applications, the sharing reduces communication latency among co-running threads, but also r...

Eddy Z. Zhang, Xipeng Shen, Yunlian Jiang

claim paper

Read More »

16

click to vote

IPPS
2010
IEEE

142views Distributed And Parallel Com...» more IPPS 2010»

Exploiting inter-thread temporal locality for chip multithreading

13 years 2 months ago

Download www.cs.virginia.edu

Multi-core organizations increasingly support multiple threads per core. Threads on a core usually share a single first-level data cache, so thread schedulers must try to minimize ...

Jiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron

claim paper

Read More »

8

click to vote

ARCS
2006
Springer

113views Software Engineering» more ARCS 2006»

Do Trace Cache, Value Prediction and Prefetching Improve SMT Throughput?

13 years 8 months ago

Download cobweb.ecn.purdue.edu

While trace cache, value prediction, and prefetching have been shown to be effective in the single-threaded superscalar, there has been no analysis of these techniques in a Simulta...

Chen-Yong Cher, Il Park, T. N. Vijaykumar

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers