Search Sciweavers | Sciweavers

68 search results - page 2 / 14

» Feature-Discovering Approximate Value Iteration Methods

196

click to vote

IJCAI
2007

162views Artificial Intelligence» more IJCAI 2007»

Improving Anytime Point-Based Value Iteration Using Principled Point Selections

15 years 8 months ago

Download ai.stanford.edu

Planning in partially-observable dynamical systems (such as POMDPs and PSRs) is a computationally challenging task. Popular approximation techniques that have proved successful ar...

Michael R. James, Michael E. Samples, Dmitri A. Do...

claim paper

Read More »

174

click to vote

AAAI
2008

151views Intelligent Agents» more AAAI 2008»

Generalized Point Based Value Iteration for Interactive POMDPs

15 years 9 months ago

Download www.aaai.org

We develop a point based method for solving finitely nested interactive POMDPs approximately. Analogously to point based value iteration (PBVI) in POMDPs, we maintain a set of bel...

Prashant Doshi, Dennis Perez

claim paper

Read More »

197

click to vote

CORR
2010
Springer

115views Education» more CORR 2010»

The complexity of solving reachability games using value and strategy iteration

15 years 7 months ago

Download www.daimi.au.dk

Concurrent reachability games is a class of games heavily studied by the computer science community, in particular by the formal methods community. Two standard algorithms for app...

Kristoffer Arnsfelt Hansen, Rasmus Ibsen-Jensen, P...

claim paper

Read More »

191

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 1 months ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

199

click to vote

JMLR
2008

129views more JMLR 2008»

Finite-Time Bounds for Fitted Value Iteration

15 years 7 months ago

Download www.sztaki.hu

In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...

Rémi Munos, Csaba Szepesvári

claim paper

Read More »

« Prev « First page 2 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers