Sciweavers

2811 search results - page 357 / 563
» Virtue: Performance Visualization of Parallel and Distribute...
Sort
View
EUROPAR
2006
Springer
15 years 8 months ago
Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences
Abstract. This paper presents a study of performance optimization of dense matrix multiplication on IBM Cyclops-64(C64) chip architecture. Although much has been published on how t...
Ziang Hu, Juan del Cuvillo, Weirong Zhu, Guang R. ...
DCOSS
2008
Springer
15 years 6 months ago
Time Synchronization in Heterogeneous Sensor Networks
Heterogeneous sensor networks consisting of resource-constrained nodes as well as resource-intensive nodes equipped with high-bandwidth sensors offer significant advantages for dev...
Isaac Amundson, Branislav Kusy, Péter V&oum...
IPPS
2009
IEEE
15 years 11 months ago
Using hardware transactional memory for data race detection
Abstract—Widespread emergence of multicore processors will spur development of parallel applications, exposing programmers to degrees of hardware concurrency hitherto unavailable...
Shantanu Gupta, Florin Sultan, Srihari Cadambi, Fr...
HPCA
2000
IEEE
15 years 8 months ago
Register Organization for Media Processing
Processor architectures with tens to hundreds of arithmetic units are emerging to handle media processing applications. These applications, such as image coding, image synthesis, ...
Scott Rixner, William J. Dally, Brucek Khailany, P...
ICPPW
2000
IEEE
15 years 8 months ago
Scheduling Multiple Multicast for Heterogeneous Network of Workstations with Non-Blocking Message-Passing
This paper proposes efficient algorithms for implementing multicast in heterogeneous workstation/PC clusters. Multicast is an important operation in many scientific and industri...
Shih-Hsien Yeh, Jan-Jan Wu, Pangfeng Liu