Search Sciweavers | Sciweavers

16 search results - page 3 / 4

» Automatic code generation for executing tiled nested loops o...

click to vote

ICS
2009
Tsinghua U.

144views Distributed And Parallel Com...» more ICS 2009»

Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs

14 years 2 days ago

Download www.cs.virginia.edu

Iterative stencil loops (ISLs) are used in many applications and tiling is a well-known technique to localize their computation. When ISLs are tiled across a parallel architecture...

Jiayuan Meng, Kevin Skadron

claim paper

Read More »

click to vote

IEEEPACT
2009
IEEE

219views Distributed And Parallel Com...» more IEEEPACT 2009»

Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling

13 years 12 months ago

Download www.des.udc.es

—Analytical models have been used to estimate optimal values for parameters such as tile sizes in the context of loop nests. However, important algorithms such as fast Fourier tr...

Basilio B. Fraguela, Yevgen Voronenko, Markus P&uu...

claim paper

Read More »

click to vote

WCRE
2003
IEEE

105views Software Engineering» more WCRE 2003»

Extracting an Explicitly Data-Parallel Representation of Image-Processing Programs

13 years 10 months ago

Download www.ece.gatech.edu

Our research goal is to retarget image processing programs written in sequential languages (e.g., C) to architectures with data-parallel processing capabilities. Image processing ...

Lewis B. Baumstark Jr., Murat Guler, Linda M. Will...

claim paper

Read More »

click to vote

CASES
2009
ACM

234views System Software» more CASES 2009»

CGRA express: accelerating execution using dynamic operation fusion

13 years 11 months ago

Download cccp.eecs.umich.edu

Coarse-grained reconﬁgurable architectures (CGRAs) present an appealing hardware platform by providing programmability with the potential for high computation throughput, scalab...

Yongjun Park, Hyunchul Park, Scott A. Mahlke

claim paper

Read More »

click to vote

CGO
2008
IEEE

142views Software Engineering» more CGO 2008»

Parallel-stage decoupled software pipelining

13 years 11 months ago

Download liberty.princeton.edu

In recent years, the microprocessor industry has embraced chip multiprocessors (CMPs), also known as multi-core architectures, as the dominant design paradigm. For existing and ne...

Easwaran Raman, Guilherme Ottoni, Arun Raman, Matt...

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers