Search Sciweavers | Sciweavers

136

Voted

PPOPP
2011
ACM

230views Distributed and Parallel Com...» more PPOPP 2011»

GRace: a low-overhead mechanism for detecting data races in GPU programs

14 years 2 months ago

In recent years, GPUs have emerged as an extremely cost-effective means for achieving high performance. Many application developers, including those with no prior parallel program...

Mai Zheng, Vignesh T. Ravi, Feng Qin, Gagan Agrawa...

claim paper

Read More »

69

click to vote

CGO
2008
IEEE

69views Software Engineering» more CGO 2008»

Latency-tolerant software pipelining in a production compiler

15 years 6 months ago

Download rw4.cs.uni-sb.de

In this paper we investigate the beneﬁt of scheduling non-critical loads for a higher latency during software pipelining. "Noncritical" denotes those loads that have s...

Sebastian Winkel, Rakesh Krishnaiyer, Robyn Sampso...

claim paper

Read More »

104

click to vote

KDD
2009
ACM

198views Data Mining» more KDD 2009»

Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data

16 years 4 days ago

Download www.pervasivedatarush.com

All Netflix Prize algorithms proposed so far are prohibitively costly for large-scale production systems. In this paper, we describe an efficient dataflow implementation of a coll...

Srivatsava Daruru, Nena M. Marin, Matt Walker, Joy...

claim paper

Read More »

108

click to vote

ISCA
2008
IEEE

136views Hardware» more ISCA 2008»

The Design and Performance of a Bare PC Web Server

14 years 11 months ago

Download triton.towson.edu

There is an increasing need for new Web server architectures that are application-centric, simple, small, and pervasive in nature. In this paper, we present a novel architecture f...

Long He, Ramesh K. Karne, Alexander L. Wijesinha

claim paper

Read More »

100

click to vote

PDCAT
2009
Springer

243views Distributed And Parallel Com...» more PDCAT 2009»

CheCUDA: A Checkpoint/Restart Tool for CUDA Applications

15 years 6 months ago

Download www.sc.isc.tohoku.ac.jp

Abstract—In this paper, a tool named CheCUDA is designed to checkpoint CUDA applications that use GPUs as accelerators. As existing checkpoint/restart implementations do not supp...

Hiroyuki Takizawa, Katsuto Sato, Kazuhiko Komatsu,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers