Search Sciweavers | Sciweavers

162 search results - page 17 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

146

click to vote

ICRA
2008
IEEE

167views Robotics» more ICRA 2008»

An approximate algorithm for solving oracular POMDPs

16 years 14 days ago

Download www.cs.cmu.edu

Abstract— We propose a new approximate algorithm, LAJIV (Lookahead J-MDP Information Value), to solve Oracular Partially Observable Markov Decision Problems (OPOMDPs), a special ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

135

click to vote

ATAL
2006
Springer

109views Intelligent Agents» more ATAL 2006»

On the relationship between MDPs and the BDI architecture

15 years 9 months ago

Download www.sci.brooklyn.cuny.edu

In this paper we describe the initial results of an investigation into the relationship between Markov Decision Processes (MDPs) and Belief-Desire-Intention (BDI) architectures. W...

Gerardo I. Simari, Simon Parsons

claim paper

Read More »

159

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

16 years 18 days ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

289

click to vote

CORR
2012
Springer

286views Education» more CORR 2012»

A Faster Algorithm for Solving One-Clock Priced Timed Games

14 years 1 months ago

Download www.daimi.au.dk

One-clock priced timed games is a class of two-player, zero-sum, continuous-time games that was deﬁned and thoroughly studied in previous works. We show that One-clock priced ti...

Thomas Dueholm Hansen, Rasmus Ibsen-Jensen, Peter ...

claim paper

Read More »

174

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 17 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers