Sciweavers

175 search results - page 27 / 35
» Dynamic Load Distribution in the Borealis Stream Processor
Sort
View
99
Voted
HPCA
2006
IEEE
16 years 1 days ago
Software-hardware cooperative memory disambiguation
In high-end processors, increasing the number of in-flight instructions can improve performance by overlapping useful processing with long-latency accesses to the main memory. Buf...
Ruke Huang, Alok Garg, Michael C. Huang
CCGRID
2004
IEEE
15 years 3 months ago
High performance LU factorization for non-dedicated clusters
This paper describes an implementation of parallel LU factorization. The focus is to achieve high performance on non-dedicated clusters, where the number of available computing re...
Toshio Endo, Kenji Kaneda, Kenjiro Taura, Akinori ...
HOTOS
1997
IEEE
15 years 3 months ago
Run-Time Code Generation as a Central System Service
We are building an operating system in which an integral run-time code generator constantly strives to improve the quality of already executing code. Our system is based on a plat...
Michael Franz
ICPPW
2006
IEEE
15 years 5 months ago
Multiple Flows of Control in Migratable Parallel Programs
Many important parallel applications require multiple flows of control to run on a single processor. In this paper, we present a study of four flow-of-control mechanisms: proces...
Gengbin Zheng, Laxmikant V. Kalé, Orion Sky...
IPPS
2006
IEEE
15 years 5 months ago
Simulation of a hybrid model for image denoising
We propose a new model for image denoising which is a hybrid of the total variation model and the Laplacian mean-curvature model. An efficient numerical procedure to compute the h...
Ricolindo Cariño, Ioana Banicescu, H. Lim, ...