Search Sciweavers | Sciweavers

178 search results - page 14 / 36

» Efficient Approximation of Optimal Control for Markov Games

click to vote

AIPS
2009

144views Artificial Intelligence» more AIPS 2009»

Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities

14 years 10 months ago

Download www.ime.usp.br

When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...

Karina Valdivia Delgado, Scott Sanner, Leliane Nun...

claim paper

Read More »

Voted

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

14 years 10 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

105

click to vote

AIPS
2006

211views Artificial Intelligence» more AIPS 2006»

Solving Factored MDPs with Exponential-Family Transition Models

14 years 11 months ago

Download www.cs.pitt.edu

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 2 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

click to vote

CDC
2010
IEEE

167views Control Systems» more CDC 2010»

Numerical methods for the optimization of nonlinear stochastic delay systems, and an application to internet regulation

14 years 4 months ago

Download www.dam.brown.edu

The Markov chain approximation method is an effective and widely used approach for computing optimal values and controls for stochastic systems. It was extended to nonlinear (and p...

Harold J. Kushner

claim paper

Read More »

« Prev « First page 14 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers