Sciweavers

2784 search results - page 271 / 557
» Instruction Level Parallelism
Sort
View
IPPS
2009
IEEE
15 years 10 months ago
Minimizing startup costs for performance-critical threading
—Using the well-known ATLAS and LAPACK dense linear algebra libraries, we demonstrate that the parallel management overhead (PMO) can grow with problem size on even statically sc...
Anthony M. Castaldo, R. Clint Whaley
CLUSTER
2008
IEEE
15 years 10 months ago
A multicore-enabled multirail communication engine
—The current trend in clusters architecture leads toward a massive use of multicore chips. This hardware evolution raises bottleneck issues at the network interface level. The us...
Elisabeth Brunet, François Trahay, Alexandr...
CGO
2009
IEEE
15 years 8 months ago
Alchemist: A Transparent Dependence Distance Profiling Infrastructure
Effectively migrating sequential applications to take advantage of parallelism available on multicore platforms is a well-recognized challenge. This paper addresses important aspec...
Xiangyu Zhang, Armand Navabi, Suresh Jagannathan
ECSA
2010
Springer
15 years 5 months ago
Model-based enhancement of software performance for precision critical systems
Architectural level analysis of a software system for its quality attributes is a proven cost-effective approach. This is particularly significant for performance, which defines m...
Naeem Muhammad, Nelis Boucké, Yolande Berbe...
ML
2007
ACM
134views Machine Learning» more  ML 2007»
15 years 3 months ago
Status report: the manticore project
The Manticore project is an effort to design and implement a new functional language for parallel programming. Unlike many earlier parallel languages, Manticore is a heterogeneous...
Matthew Fluet, Nic Ford, Mike Rainey, John H. Repp...