Sciweavers

1141 search results - page 92 / 229
» Compiler-Directed Performance Model Construction for Paralle...
Sort
View
CCGRID
2009
IEEE
15 years 9 months ago
Natively Supporting True One-Sided Communication in
As high-end computing systems continue to grow in scale, the performance that applications can achieve on such large scale systems depends heavily on their ability to avoid explic...
Gopalakrishnan Santhanaraman, Pavan Balaji, K. Gop...
WSC
2008
15 years 4 months ago
Improving performance by replicating simulations with alternative synchronization approaches
Parallel and distributed simulation facilitates the construction of a simulation application (i.e., federation in HLA terminology) with a number of simulation components (federate...
Zengxiang Li, Wentong Cai, Stephen John Turner, Ke...
SPAA
2010
ACM
15 years 7 months ago
Implementing and evaluating nested parallel transactions in software transactional memory
Transactional Memory (TM) is a promising technique that simplifies parallel programming for shared-memory applications. To date, most TM systems have been designed to efficientl...
Woongki Baek, Nathan Grasso Bronson, Christos Kozy...
JPDC
2008
167views more  JPDC 2008»
15 years 2 months ago
A performance study of general-purpose applications on graphics processors using CUDA
Graphics processors (GPUs) provide a vast number of simple, data-parallel, deeply multithreaded cores and high memory bandwidths. GPU architectures are becoming increasingly progr...
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarj...
TPDS
2010
93views more  TPDS 2010»
15 years 18 days ago
Self-Consistent MPI Performance Guidelines
Message passing using the Message Passing Interface (MPI) is at present the most widely adopted framework for programming parallel applications for distributed-memory and clustere...
Jesper Larsson Träff, William D. Gropp, Rajee...