Search Sciweavers | Sciweavers

14 search results - page 2 / 3

» Tradeoff between data-, instruction-, and thread-level paral...

click to vote

ISPASS
2009
IEEE

240views Software Engineering» more ISPASS 2009»

Analyzing CUDA workloads using a detailed GPU simulator

14 years 4 days ago

Download www.ece.ubc.ca

Modern Graphic Processing Units (GPUs) provide sufﬁciently ﬂexible programming models that understanding their performance can provide insight in designing tomorrow’s manyco...

Ali Bakhoda, George L. Yuan, Wilson W. L. Fung, He...

claim paper

Read More »

click to vote

MICRO
2003
IEEE

148views Hardware» more MICRO 2003»

Fast Secure Processor for Inhibiting Software Piracy and Tampering

13 years 10 months ago

Download www.microarch.org

Due to the widespread software piracy and virus attacks, signiﬁcant efforts have been made to improve security for computer systems. For stand-alone computers, a key observation...

Jun Yang 0002, Youtao Zhang, Lan Gao

claim paper

Read More »

click to vote

LCTRTS
2005
Springer

160views System Software» more LCTRTS 2005»

Cache aware optimization of stream programs

13 years 10 months ago

Download groups.csail.mit.edu

Effective use of the memory hierarchy is critical for achieving high performance on embedded systems. We focus on the class of streaming applications, which is increasingly preval...

Janis Sermulins, William Thies, Rodric M. Rabbah, ...

claim paper

Read More »

click to vote

SPDP
1991
IEEE

130views Distributed And Parallel Com...» more SPDP 1991»

Local vs. global memory in the IBM RP3: experiments and performance modelling

13 years 8 months ago

Download web.it.kth.se

A number of experiments regarding the placement of instructions, private data and shared data in the Non-Uniform-Memory-Access multiprocessor, RP3 has been performed. Three Scient...

Mats Brorsson

claim paper

Read More »

click to vote

ISCA
1994
IEEE

129views Hardware» more ISCA 1994»

Impact of Sharing-Based Thread Placement on Multithreaded Architectures

13 years 9 months ago

Download www.cs.sfu.ca

Multithreaded architectures context switch between instruction streams to hide memory access latency. Although this improves processor utilization, it can increase cache interfere...

Radhika Thekkath, Susan J. Eggers

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers