Sciweavers

5523 search results - page 393 / 1105
» Improving application performance with hardware data structu...
Sort
View
VISSYM
2004
15 years 7 months ago
Occlusion Culling for Sub-Surface Models in Geo-Scientific Applications
We present a three pass occlusion culling algorithm, which makes efficient use of hardware support. Our geo-scientific sub-surface data sets consist typically of a set of high res...
John Plate, Anselm Grundhöfer, Benjamin Schmi...
ISCA
2010
IEEE
405views Hardware» more  ISCA 2010»
15 years 11 months ago
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU
Recent advances in computing have led to an explosion in the amount of data being generated. Processing the ever-growing data in a timely manner has made throughput computing an i...
Victor W. Lee, Changkyu Kim, Jatin Chhugani, Micha...
PPOPP
2010
ACM
16 years 3 months ago
Scalable communication protocols for dynamic sparse data exchange
Many large-scale parallel programs follow a bulk synchronous parallel (BSP) structure with distinct computation and communication phases. Although the communication phase in such ...
Torsten Hoefler, Christian Siebert, Andrew Lumsdai...
VTC
2008
IEEE
102views Communications» more  VTC 2008»
16 years 17 days ago
Two-Level Early Stopping Algorithm for LTE Turbo Decoding
—The design of LTE turbo coding chain suitable for flexible parallel and pipelined hardware implementations is presented. The hierarchical data structure further offers an opport...
Jung-Fu Cheng
137
Voted
ASAP
2007
IEEE
169views Hardware» more  ASAP 2007»
16 years 17 days ago
Reduced Delay BCD Adder
Financial and commercial applications use decimal data and spend most of their time in decimal arithmetic. Software implementation of decimal arithmetic is typically at least 100 ...
A. A. Bayrakci, A. Akkas