Search Sciweavers | Sciweavers

33 search results - page 7 / 7

» Tree Based Discretization for Continuous State Space Reinfor...

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

13 years 7 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

click to vote

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

13 years 6 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

13 years 6 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

« Prev « First page 7 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers