Search Sciweavers | Sciweavers

14 search results - page 2 / 3

» Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

click to vote

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

14 years 5 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

click to vote

SPAA
2004
ACM

70views Distributed And Parallel Com...» more SPAA 2004»

Packet-mode policies for input-queued switches

13 years 10 months ago

Download www.cs.technion.ac.il

This paper considers the problem of packet-mode scheduling of input queuedswitches. Packets have variable lengths, and are divided into cells of unit length. Each packet arrives t...

Dan Guez, Alexander Kesselman, Adi Rosén

claim paper

Read More »

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

13 years 2 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

click to vote

CN
2004

109views more CN 2004»

Modeling correlations in web traces and implications for designing replacement policies

13 years 4 months ago

Download www.stanford.edu

A number of web cache-related algorithms, such as replacement and prefetching policies, rely on specific characteristics present in the sequence of requests for efficient performa...

Konstantinos Psounis, An Zhu, Balaji Prabhakar, Ra...

claim paper

Read More »

click to vote

TON
2008

155views more TON 2008»

A comparative analysis of server selection in content replication networks

13 years 4 months ago

Download people.bu.edu

Server selection plays an essential role in content replication networks, such as peer-to-peer (P2P) and content delivery networks (CDNs). In this paper, we perform an analytical i...

Tao Wu, David Starobinski

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers