Sciweavers

1235 search results - page 40 / 247
» ABC Reinforcement Learning
Sort
View
TSMC
2008
76views more  TSMC 2008»
14 years 11 months ago
Improved Adaptive-Reinforcement Learning Control for Morphing Unmanned Air Vehicles
This paper presents an improved Adaptive
John Valasek, James Doebbler, Monish D. Tandale, A...
IWLCS
2005
Springer
15 years 5 months ago
Counter Example for Q-Bucket-Brigade Under Prediction Problem
Aiming to clarify the convergence or divergence conditions for Learning Classifier System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...
Atsushi Wada, Keiki Takadama, Katsunori Shimohara
ICML
2006
IEEE
16 years 20 days ago
PAC model-free reinforcement learning
For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...
Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...
AGENTS
2001
Springer
15 years 4 months ago
Using background knowledge to speed reinforcement learning in physical agents
This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...
Daniel G. Shapiro, Pat Langley, Ross D. Shachter
AAAI
1997
15 years 1 months ago
Reinforcement Learning with Time
This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...
Daishi Harada