Sciweavers

2609 search results - page 20 / 522
» Optimizing for parallelism and data locality
Sort
View
SC
2005
ACM
15 years 3 months ago
Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions
A very challenging issue for optimizing compilers is the phase ordering problem: In what order should a collection of compiler optimizations be performed? We address this problem ...
Swarup Kumar Sahoo, Sriram Krishnamoorthy, Rajkira...
USENIX
1996
14 years 11 months ago
Transparent Fault Tolerance for Parallel Applications on Networks of Workstations
This paper describes a new method for providingtransparent fault tolerance for parallel applications on a network of workstations. We have designed our method in the context of sh...
Daniel J. Scales, Monica S. Lam
ICPR
2008
IEEE
15 years 11 months ago
3D model based vehicle localization by optimizing local gradient based fitness evaluation
We address the problem of 3D model based vehicle localization in calibrated traffic scenes. A wireframe vehicle model is set up as prior information and an efficient local gradien...
Kaiqi Huang, Min Li, Tieniu Tan, Zhaoxiang Zhang
IPPS
2003
IEEE
15 years 3 months ago
ECO: An Empirical-Based Compilation and Optimization System
In this paper, we describe a compilation system that automates much of the process of performance tuning that is currently done manually by application programmers interested in h...
Nastaran Baradaran, Jacqueline Chame, Chun Chen, P...
IEEEPACT
2008
IEEE
15 years 4 months ago
Improving support for locality and fine-grain sharing in chip multiprocessors
Both commercial and scientific workloads benefit from concurrency and exhibit data sharing across threads/processes. The resulting sharing patterns are often fine-grain, with t...
Hemayet Hossain, Sandhya Dwarkadas, Michael C. Hua...