Sciweavers

175 search results - page 27 / 35
» Dynamic Load Distribution in the Borealis Stream Processor
Sort
View
HPCA
2006
IEEE
16 years 5 months ago
Software-hardware cooperative memory disambiguation
In high-end processors, increasing the number of in-flight instructions can improve performance by overlapping useful processing with long-latency accesses to the main memory. Buf...
Ruke Huang, Alok Garg, Michael C. Huang
131
Voted
CCGRID
2004
IEEE
15 years 9 months ago
High performance LU factorization for non-dedicated clusters
This paper describes an implementation of parallel LU factorization. The focus is to achieve high performance on non-dedicated clusters, where the number of available computing re...
Toshio Endo, Kenji Kaneda, Kenjiro Taura, Akinori ...
HOTOS
1997
IEEE
15 years 9 months ago
Run-Time Code Generation as a Central System Service
We are building an operating system in which an integral run-time code generator constantly strives to improve the quality of already executing code. Our system is based on a plat...
Michael Franz
152
Voted
ICPPW
2006
IEEE
15 years 11 months ago
Multiple Flows of Control in Migratable Parallel Programs
Many important parallel applications require multiple flows of control to run on a single processor. In this paper, we present a study of four flow-of-control mechanisms: proces...
Gengbin Zheng, Laxmikant V. Kalé, Orion Sky...
IPPS
2006
IEEE
15 years 11 months ago
Simulation of a hybrid model for image denoising
We propose a new model for image denoising which is a hybrid of the total variation model and the Laplacian mean-curvature model. An efficient numerical procedure to compute the h...
Ricolindo Cariño, Ioana Banicescu, H. Lim, ...