Sciweavers

GECCO
2004
Springer

Improving MACS Thanks to a Comparison with 2TBNs

13 years 10 months ago
Improving MACS Thanks to a Comparison with 2TBNs
Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classifier Systems research. This framework is mostly used in the context of Two-stage Bayes Networks, a subset of Bayes Networks. In this paper, we compare the Learning Classifier Systems approach and the Bayes Networks approach to factored Markov Decision Problems. More specifically, we focus on a comparison between MACS, an Anticipatory Learning Classifier System, and Structured Policy Iteration, a general planning algorithm used in the context of Two-stage Bayes Networks. From that comparison, we define a new algorithm resulting from the adaptation of Structured Policy Iteration to the context of MACS. We conclude by calling for a closer communication between both research communities.
Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil
Added 01 Jul 2010
Updated 01 Jul 2010
Type Conference
Year 2004
Where GECCO
Authors Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuillemin
Comments (0)