Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

ML
2002
ACM

favoriteEmaildiscussreport

100views Machine Learning» more ML 2002»

Structure in the Space of Value Functions

13 years 4 months ago

Structure in the Space of Value Functions

Download www.gatsby.ucl.ac.uk

Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental fragments. We suggest how to find fragmentations using unsupervised, mixture model, learning methods on data derived from optimal value functions for multiple tasks, and show that these fragmentations are in accord with observable structure in the environments. Further, we present evidence that such fragments can be of use in a practical reinforcement learning context, by facilitating online, actor-critic learning of multiple goals MDPs.

David J. Foster, Peter Dayan

Real-time Traffic

Efficient Manner | Machine Learning | ML 2002 | Optimal Control Tasks | Optimal Value Functions |

claim paper

Related Content

» On the Construction of Initial Basis Function for Efficient Value Function Approximation

» Characterizing the Space of interatomic Distance Distribution Functions Consistent with So...

» Optimal Coalition Structure Generation In Partition Function Games

» Checking ValueSensitive Data Structures in Sublinear Space

» ValueFunctionBased Transfer for Reinforcement Learning Using Structure Mapping

» Modp Decision Diagrams A Data Structure for MultipleValued Functions

» Multiagent Planning with Factored MDPs

» Protovalue functions developmental reinforcement learning

» Constructing basis functions from directed graphs for value function approximation

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	2002
Where	ML
Authors	David J. Foster, Peter Dayan

Comments (0)