FPGA 2016 | Sciweavers

61

FPGA
2016
ACM

108views FPGA» more FPGA 2016»

A Study of Pointer-Chasing Performance on Shared-Memory Processor-FPGA Systems

8 years 16 days ago

The advent of FPGA acceleration platforms with direct coherent access to processor memory creates an opportunity for accelerating applications with irregular parallelism governed ...

Gabriel Weisz, Joseph Melber, Yu Wang, Kermin Flem...

claim paper

Read More »

20

click to vote

FPGA
2016
ACM

63views FPGA» more FPGA 2016»

Automatically Optimizing the Latency, Area, and Accuracy of C Programs for High-Level Synthesis

8 years 16 days ago

Download cas.ee.ic.ac.uk

Loops are pervasive in numerical programs, so high-level synthesis (HLS) tools use state-of-the-art scheduling techniques to pipeline them eﬃciently. Still, the run time perform...

Xitong Gao, John Wickerson, George A. Constantinid...

claim paper

Read More »

22

click to vote

FPGA
2016
ACM

75views FPGA» more FPGA 2016»

FPRESSO: Enabling Express Transistor-Level Exploration of FPGA Architectures

8 years 16 days ago

Download lap.epfl.ch

In theory, tools like VTR—a retargetable toolchain mapping circuits onto easily-described hypothetical FPGA architectures—could play a key role in the development of wildly in...

Grace Zgheib, Manana Lortkipanidze, Muhsen Owaida,...

claim paper

Read More »

16

click to vote

FPGA
2016
ACM

72views FPGA» more FPGA 2016»

CASK: Open-Source Custom Architectures for Sparse Kernels

8 years 16 days ago

Download www.doc.ic.ac.uk

Sparse matrix vector multiplication (SpMV) is an important kernel in many scientiﬁc applications. To improve the performance and applicability of FPGA based SpMV, we propose an ...

Paul Grigoras, Pavel Burovskiy, Wayne Luk

claim paper

Read More »

15

click to vote

FPGA
2016
ACM

69views FPGA» more FPGA 2016»

A Case for Work-stealing on FPGAs with OpenCL Atomics

8 years 16 days ago

Download cas.ee.ic.ac.uk

We provide a case study of work-stealing, a popular method for run-time load balancing, on FPGAs. Following the Cederman–Tsigas implementation for GPUs, we synchronize workitems...

Nadesh Ramanathan, John Wickerson, Felix Winterste...

claim paper

Read More »

11

click to vote

FPGA
2016
ACM

71views FPGA» more FPGA 2016»

Resolve: Generation of High-Performance Sorting Architectures from High-Level Synthesis

8 years 16 days ago

Download kastner.ucsd.edu

Field Programmable Gate Array (FPGA) implementations of sorting algorithms have proven to be eﬃcient, but existing implementations lack portability and maintainability because t...

Janarbek Matai, Dustin Richmond, Dajung Lee, Zac B...

claim paper

Read More »

20

click to vote

FPGA
2016
ACM

83views FPGA» more FPGA 2016»

GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths

8 years 16 days ago

Download nachiket.github.io

Bitwidth optimization of FPGA datapaths can save hardware resources by choosing the fewest number of bits required for each datapath variable to achieve a desired quality of resul...

Nachiket Kapre, Deheng Ye

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers