Sciweavers

44 search results - page 1 / 9
» Batch reinforcement learning in a complex domain
Sort
View
ATAL
2007
Springer
13 years 10 months ago
Batch reinforcement learning in a complex domain
Temporal difference reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...
Shivaram Kalyanakrishnan, Peter Stone
ECML
2004
Springer
13 years 10 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner
ROBOCUP
2007
Springer
153views Robotics» more  ROBOCUP 2007»
13 years 10 months ago
Model-Based Reinforcement Learning in a Complex Domain
Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...
Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu
IAT
2008
IEEE
13 years 4 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
NECO
2002
105views more  NECO 2002»
13 years 4 months ago
Multiple Model-Based Reinforcement Learning
We propose a modular reinforcement learning architecture for non-linear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic i...
Kenji Doya, Kazuyuki Samejima, Ken-ichi Katagiri, ...