Improving MACS Thanks to a Comparison with 2TBNs

15 years 11 months ago

Download www.cs.york.ac.uk

Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classiﬁer Systems research. This framework is mostly used in the context of Two-stage Bayes Networks, a subset of Bayes Networks. In this paper, we compare the Learning Classiﬁer Systems approach and the Bayes Networks approach to factored Markov Decision Problems. More speciﬁcally, we focus on a comparison between MACS, an Anticipatory Learning Classiﬁer System, and Structured Policy Iteration, a general planning algorithm used in the context of Two-stage Bayes Networks. From that comparison, we deﬁne a new algorithm resulting from the adaptation of Structured Policy Iteration to the context of MACS. We conclude by calling for a closer communication between both research communities.

Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil

Real-time Traffic

Bayes Networks | GECCO 2004 | Learning Classiﬁer System | Two-stage Bayes Networks |

claim paper

Post Info
More Details (n/a)

Added	01 Jul 2010
Updated	01 Jul 2010
Type	Conference
Year	2004
Where	GECCO
Authors	Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuillemin

Comments (0)

Sciweavers

Improving MACS Thanks to a Comparison with 2TBNs

Bayes Networks | GECCO 2004 | Learning Classiﬁer System | Two-stage Bayes Networks |

Explore & Download

Productivity Tools

Sciweavers