Search Sciweavers | Sciweavers

162 search results - page 12 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

149

click to vote

AIPS
2007

80views Artificial Intelligence» more AIPS 2007»

Prioritizing Bellman Backups without a Priority Queue

15 years 8 months ago

Download www.cs.washington.edu

Several researchers have shown that the efﬁciency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of...

Peng Dai, Eric A. Hansen

claim paper

Read More »

168

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 6 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

193

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

16 years 1 hour ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

130

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

15 years 7 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

194

click to vote

ICTAI
2010
IEEE

226views Artificial Intelligence» more ICTAI 2010»

A Closer Look at MOMDPs

15 years 4 months ago

Download www.loria.fr

Abstract--The difficulties encountered in sequential decisionmaking problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of th...

Mauricio Araya-López, Vincent Thomas, Olivi...

claim paper

Read More »

« Prev « First page 12 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers