Sciweavers

51 search results - page 2 / 11
» Improving Approximate Value Iteration Using Memories and Pre...
Sort
View

Publication
222views
14 years 1 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
ICAI
2009
13 years 2 months ago
On the Construction of Initial Basis Function for Efficient Value Function Approximation
- We address the issues of improving the feature generation methods for the value-function approximation and the state space approximation. We focus the improvement of feature gene...
Chung-Cheng Chiu, Kuan-Ta Chen
SIGMETRICS
2000
ACM
105views Hardware» more  SIGMETRICS 2000»
13 years 9 months ago
Using the exact state space of a Markov model to compute approximate stationary measures
We present a new approximation algorithm based on an exact representation of the state space S, using decision diagrams, and of the transition rate matrix R, using Kronecker algeb...
Andrew S. Miner, Gianfranco Ciardo, Susanna Donate...
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
13 years 11 months ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone

Publication
334views
14 years 1 months ago
Rollout Sampling Approximate Policy Iteration
Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...
Christos Dimitrakakis, Michail G. Lagoudakis