Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

ICML
2008
IEEE

favoriteEmaildiscussreport

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 5 months ago

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that increase an agent's reward. Unfortunately, most POMDPs are defined with a large number of parameters which are difficult to specify only from domain knowledge. In this paper, we present an approximation approach that allows us to treat the POMDP model parameters as additional hidden state in a "model-uncertainty" POMDP. Coupled with model-directed queries, our planner actively learns good policies. We demonstrate our approach on several POMDP problems.

Finale Doshi, Joelle Pineau, Nicholas Roy

Real-time Traffic

Additional Hidden State | ICML 2008 | Machine Learning | Observable Markov Decision | POMDP Model Parameters |

claim paper

Related Content

» Could Active Perception Aid Navigation of Partially Observable Grid Worlds

» Bayesian reinforcement learning for POMDPbased dialogue systems

» Selecting actions for resourcebounded information extraction using reinforcement learning

» Advanced Metrics for ClassDriven Similarity Search

» Selecting Operator Queries Using Expected Myopic Gain

» Teaching a Robot to Perform Tasks with Voice Commands

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2008
Where	ICML
Authors	Finale Doshi, Joelle Pineau, Nicholas Roy

Comments (0)