Sciweavers

778 search results - page 99 / 156
» Efficient Code Generation for Automatic Parallelization and ...
Sort
View
ASPLOS
2011
ACM
14 years 1 months ago
Sponge: portable stream programming on graphics engines
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance computations. The introduction of new programming languages, such as CUDA and OpenCL...
Amir Hormati, Mehrzad Samadi, Mark Woh, Trevor N. ...
SANSKRIT
2010
14 years 7 months ago
Sanskrit Compound Processor
Sanskrit is very rich in compound formation. Typically a compound does not code the relation between its components explicitly. To understand the meaning of a compound, it is neces...
Anil Kumar, Vipul Mittal, Amba Kulkarni
IPPS
2006
IEEE
15 years 3 months ago
The monitoring request interface (MRI)
In this paper we present MRI, a high level interface for selective monitoring of code regions and data structures in single and multiprocessor environments. MRI keeps transparent ...
Edmond Kereku, Michael Gerndt
CASES
2008
ACM
14 years 11 months ago
Optimus: efficient realization of streaming applications on FPGAs
In this paper, we introduce Optimus: an optimizing synthesis compiler for streaming applications. Optimus compiles programs written in a high level streaming language to either so...
Amir Hormati, Manjunath Kudlur, Scott A. Mahlke, D...
ICA
2010
Springer
14 years 10 months ago
Binary Sparse Coding
We study a sparse coding learning algorithm that allows for a simultaneous learning of the data sparseness and the basis functions. The algorithm is derived based on a generative m...
Marc Henniges, Gervasio Puertas, Jörg Bornsch...