Sciweavers

286 search results - page 2 / 58
» Using inaccurate models in reinforcement learning
Sort
View
ICML
2008
IEEE
14 years 6 months ago
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...
Finale Doshi, Joelle Pineau, Nicholas Roy
AIIDE
2008
13 years 7 months ago
Combining Model-Based Meta-Reasoning and Reinforcement Learning for Adapting Game-Playing Agents
Human experience with interactive games will be enhanced if the software agents that play the game learn from their failures. Techniques such as reinforcement learning provide one...
Patrick Ulam, Joshua Jones, Ashok K. Goel
WSC
2007
13 years 7 months ago
Optimizing time warp simulation with reinforcement learning techniques
Adaptive Time Warp protocols in the literature are usually based on a pre-defined analytic model of the system, expressed as a closed form function that maps system state to cont...
Jun Wang, Carl Tropper
ROBOCUP
2007
Springer
153views Robotics» more  ROBOCUP 2007»
13 years 11 months ago
Model-Based Reinforcement Learning in a Complex Domain
Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...
Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu
ATAL
2006
Springer
13 years 9 months ago
Rule value reinforcement learning for cognitive agents
RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...
Christopher Child, Kostas Stathis