Search Sciweavers | Sciweavers

4495 search results - page 4 / 899

» A Performance Monitoring System for Large Computing Clusters

click to vote

ISPAN
2005
IEEE

112views Distributed And Parallel Com...» more ISPAN 2005»

A Scalable Method for Predicting Network Performance in Heterogeneous Clusters

13 years 11 months ago

Download www.cs.virginia.edu

An important requirement for the effective scheduling of parallel applications on large heterogeneous clusters is a current view of system resource availability. Maintaining such ...

Dimitrios Katramatos, Steve J. Chapin

claim paper

Read More »

click to vote

IPPS
2005
IEEE

132views Distributed And Parallel Com...» more IPPS 2005»

Performance Implications of Periodic Checkpointing on Large-Scale Cluster Systems

13 years 11 months ago

Download adam.oliner.net

Large-scale systems like BlueGene/L are susceptible to a number of software and hardware failures that can affect system performance. Periodic application checkpointing is a commo...

Adam J. Oliner, Ramendra K. Sahoo, José E. ...

claim paper

Read More »

click to vote

CCGRID
2003
IEEE

132views Distributed And Parallel Com...» more CCGRID 2003»

Improving Performance via Computational Replication on a Large-Scale Computational Grid

13 years 11 months ago

Download abner.ncat.edu

Yaohang Li, Michael Mascagni

claim paper

Read More »

click to vote

CCGRID
2008
IEEE

122views Distributed And Parallel Com...» more CCGRID 2008»

Scalable Data Gathering for Real-Time Monitoring Systems on Distributed Computing

14 years 19 days ago

Download www.logos.ic.i.u-tokyo.ac.jp

Real-time monitoring is increasingly becoming important in various scenes of large scale, multi-site distributed/parallel computing, e.g, understanding behavior of systems, schedu...

Yoshikazu Kamoshida, Kenjiro Taura

claim paper

Read More »

click to vote

ICDCS
2009
IEEE

155views Distributed And Parallel Com...» more ICDCS 2009»

REMO: Resource-Aware Application State Monitoring for Large-Scale Distributed Systems

14 years 3 months ago

Download www.cc.gatech.edu

To observe, analyze and control large scale distributed systems and the applications hosted on them, there is an increasing need to continuously monitor performance attributes of ...

Shicong Meng, Srinivas R. Kashyap, Chitra Venkatra...

claim paper

Read More »

« Prev « First page 4 / 899 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers