Search Sciweavers | Sciweavers

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

112

click to vote

ICML
2002
IEEE

218views Machine Learning» more ICML 2002»

An epsilon-Optimal Grid-Based Algorithm for Partially Observable Markov Decision Processes

16 years 4 months ago

Download www.cs.ucf.edu

Blai Bonet

claim paper

Read More »

113

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

15 years 8 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

138

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 6 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

« Prev « First page 1 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers