Search Sciweavers | Sciweavers

126

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 6 months ago

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

94

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 6 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

112

click to vote

AAAI
2006

94views Intelligent Agents» more AAAI 2006»

Factored MDP Elicitation and Plan Display

15 years 3 months ago

Download www.aaai.org

The software suite we will demonstrate at AAAI '06 was designed around planning with factored Markov decision processes (MDPs). It is a user-friendly suite that facilitates d...

Krol Kevin Mathias, Casey Lengacher, Derek William...

claim paper

Read More »

92

click to vote

AIPS
2006

161views Artificial Intelligence» more AIPS 2006»

Automated Planning Using Quantum Computation

15 years 3 months ago

Download www.aaai.org

This paper presents an adaptation of the standard quantum search technique to enable application within Dynamic Programming, in order to optimise a Markov Decision Process. This i...

Sanjeev Naguleswaran, Langford B. White, I. Fuss

claim paper

Read More »

96

click to vote

AIPS
2003

149views Artificial Intelligence» more AIPS 2003»

Synthesis of Hierarchical Finite-State Controllers for POMDPs

15 years 3 months ago

Download www.aaai.org

We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical ﬁnite-state control...

Eric A. Hansen, Rong Zhou

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers