Search Sciweavers | Sciweavers

656 search results - page 35 / 132

» Complexity of finite-horizon Markov decision process problem...

111

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 3 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

125

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 1 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

114

Voted

IPCO
2008

114views Optimization» more IPCO 2008»

The Stochastic Machine Replenishment Problem

15 years 3 months ago

Download www.cs.duke.edu

We study the stochastic machine replenishment problem, which is a canonical special case of closed multiclass queuing systems in Markov decision theory. The problem models the sche...

Kamesh Munagala, Peng Shi

claim paper

Read More »

142

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 5 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

123

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 2 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 35 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers