Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

9

SARA
2005
Springer

favoriteEmaildiscussreport

102views Artificial Intelligence» more SARA 2005»

Feature-Discovering Approximate Value Iteration Methods

13 years 9 months ago

Feature-Discovering Approximate Value Iteration Methods

Download cobweb.ecn.purdue.edu

Sets of features in Markov decision processes can play a critical role ximately representing value and in abstracting the state space. Selection of features is crucial to the success of a system and is most often conducted by a human. We study the problem of automatically selecting problem features, and propose and evaluate a simple approach reducing the problem of selecting a new feature to standard classiﬁcation learning. We learn a classiﬁer that predicts the sign of the Bellman error over a training set of states. By iteratively adding new classiﬁers as features with this method, training between iterations with approximate value iteration, we ﬁnd a Tetris feature set that outperforms randomly constructed features signiﬁcantly, and obtains a score of about three-tenths of the highest score obtained by using a carefully hand-constructed feature set. We also show that features learned with this method outperform those learned with the previous method of Patrascu et al. [4] ...

Jia-Hong Wu, Robert Givan

Real-time Traffic

Approximate Value Iteration | Artificial Intelligence | Markov Decision Processes | Problem Features | SARA 2005 |

claim paper

Related Content

» Iterative method for solving a nonlinear boundary value problem

» Least absolute policy iteration for robust value function approximation

» Efficient exploration through active learning for value function approximation in reinforc...

» Forward Search Value Iteration for POMDPs

» Improving Anytime PointBased Value Iteration Using Principled Point Selections

» Generalized Point Based Value Iteration for Interactive POMDPs

» The complexity of solving reachability games using value and strategy iteration

» A stochastic approximation method with maxnorm projections and its applications to the Qle...

» FiniteTime Bounds for Fitted Value Iteration

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	SARA
Authors	Jia-Hong Wu, Robert Givan

Comments (0)