Sciweavers

14 search results - page 3 / 3
» Sensitive Discount Optimality: Unifying Discounted and Avera...
Sort
View
NECO
2007
87views more  NECO 2007»
13 years 4 months ago
Reinforcement Learning State Estimator
cal networks in the learning of abstract and effector-specific representations of motor sequences. Neuroimage. 32, 714-727. (Neuroimage Editor’s Choice Award, 2006) Daw, N. D. Do...
Jun Morimoto, Kenji Doya
IWANN
1999
Springer
13 years 9 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
UAI
2003
13 years 6 months ago
On the Convergence of Bound Optimization Algorithms
Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...
Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...
IIE
2007
63views more  IIE 2007»
13 years 5 months ago
Investigation of Q-Learning in the Context of a Virtual Learning Environment
We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...
Dalia Baziukaite