Sciweavers

3379 search results - page 354 / 676
» Parallel cross-entropy optimization
Sort
View
111
Voted
ICCAD
2005
IEEE
131views Hardware» more  ICCAD 2005»
16 years 14 days ago
Code restructuring for improving cache performance of MPSoCs
— One of the critical goals in code optimization for MPSoC architectures is to minimize the number of off-chip memory accesses. This is because such accesses can be extremely cos...
Guilin Chen, Mahmut T. Kandemir
129
Voted
ICDCS
2009
IEEE
15 years 10 months ago
Explicit Batching for Distributed Objects
Although distributed object systems, for example RMI and CORBA, enable object-oriented programs to be easily distributed across a network, achieving acceptable performance usually...
Eli Tilevich, William R. Cook, Yang Jiao
127
Voted
ICS
2009
Tsinghua U.
15 years 10 months ago
Computer generation of fast fourier transforms for the cell broadband engine
The Cell BE is a multicore processor with eight vector accelerators (called SPEs) that implement explicit cache management through direct memory access engines. While the Cell has...
Srinivas Chellappa, Franz Franchetti, Markus P&uum...
130
Voted
IPPS
2009
IEEE
15 years 10 months ago
Sparse collective operations for MPI
Abstract—We discuss issues in designing sparse (nearest neighbor) collective operations for communication and reduction operations in small neighborhoods for the Message Passing ...
Torsten Hoefler, Jesper Larsson Träff
136
Voted
MIDDLEWARE
2009
Springer
15 years 10 months ago
Rhizoma: A Runtime for Self-deploying, Self-managing Overlays
Abstract. The trend towards cloud and utility computing infrastructures raises challenges not only for application development, but also for management: diverse resources, changing...
Qin Yin, Adrian Schüpbach, Justin Cappos, And...