Sciweavers

FPGA
2016
ACM
72views FPGA» more  FPGA 2016»
10 years 1 months ago
CASK: Open-Source Custom Architectures for Sparse Kernels
Sparse matrix vector multiplication (SpMV) is an important kernel in many scientific applications. To improve the performance and applicability of FPGA based SpMV, we propose an ...
Paul Grigoras, Pavel Burovskiy, Wayne Luk
201
Voted
FPGA
2016
ACM
69views FPGA» more  FPGA 2016»
10 years 1 months ago
A Case for Work-stealing on FPGAs with OpenCL Atomics
We provide a case study of work-stealing, a popular method for run-time load balancing, on FPGAs. Following the Cederman–Tsigas implementation for GPUs, we synchronize workitems...
Nadesh Ramanathan, John Wickerson, Felix Winterste...
FPGA
2016
ACM
71views FPGA» more  FPGA 2016»
10 years 1 months ago
Resolve: Generation of High-Performance Sorting Architectures from High-Level Synthesis
Field Programmable Gate Array (FPGA) implementations of sorting algorithms have proven to be efficient, but existing implementations lack portability and maintainability because t...
Janarbek Matai, Dustin Richmond, Dajung Lee, Zac B...
FPGA
2016
ACM
83views FPGA» more  FPGA 2016»
10 years 1 months ago
GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths
Bitwidth optimization of FPGA datapaths can save hardware resources by choosing the fewest number of bits required for each datapath variable to achieve a desired quality of resul...
Nachiket Kapre, Deheng Ye
FOSSACS
2016
Springer
10 years 1 months ago
Synchronizing Automata over Nested Words
Abstract. We extend the concept of a synchronizing word from finitestate automata (DFA) to nested word automata (NWA): A well-matched nested word is called synchronizing if it res...
Dmitry Chistikov, Pavel Martyugin, Mahsa Shirmoham...