Search Sciweavers | Sciweavers

682 search results - page 84 / 137

» One-Counter Markov Decision Processes

165

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

16 years 13 days ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

179

click to vote

AIPS
2000

107views Artificial Intelligence» more AIPS 2000»

On-line Scheduling via Sampling

15 years 7 months ago

Download www.aaai.org

1 We consider the problem of scheduling an unknown sequence of tasks for a single server as the tasks arrive with the goal off maximizing the total weighted value of the tasks serv...

Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong

claim paper

Read More »

184

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

177

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

15 years 3 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

195

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 3 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

« Prev « First page 84 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers