Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

155

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 8 months ago

Counter Example for Q-Bucket-Brigade Under Prediction Problem

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of LCS diverges; and (2) methods to avoid such divergence. Based on our previous work that showed equivalence between LCS’s reinforcement process and Reinforcement Learning (RL) with Function approximation (FA) method, we present a counter-example for LCS with Q-bucketbrigade based on the 11-state star problem, a counterexample originally proposed to show the divergence of Qlearning with linear FA. Furthermore, the empirical results applying the counter-example to LCS veriﬁed the results predicted from the theory: (1) LCS with Q-bucket-brigade diverged under the prediction problem, where the action selection policy was ﬁxed; and (2) such divergence was avoided by using implicit-bucket-brigade or applying residual gradient algorithm to Q-bucket-brigade. Categories and Subject Descriptors I.2.6 [Artiﬁcial ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

Real-time Traffic

IWLCS 2005 | LCS’s Reinforcement Process | Machine Learning | Reinforcement Learning | Reinforcement Process |

claim paper

Related Content

» OneCounter Stochastic Games

» An extremal optimization search method for the protein folding problem the gomodel example

» Automatically countering imbalance and its empirical relationship to cost

» Ensemble Learning with Active Example Selection for Imbalanced Biomedical Data Classificat...

» Discovering local patterns of co evolution computational aspects and biological examples

» Learning and evaluating classifiers under sample selection bias

» Regret Bounds for Prediction Problems

» Robust tubebased MPC for constrained mobile robots under slip conditions

» Module assignment for pinlimited designs under the stackedVdd paradigm

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	IWLCS
Authors	Atsushi Wada, Keiki Takadama, Katsunori Shimohara

Comments (0)