Sciweavers

3084 search results - page 127 / 617
» Learning to Take Actions
Sort
View
IADIS
2003
14 years 11 months ago
E-Blended Learning for Distance Learners
E-blended learning as a new methodology will be explained. E-blended learning scenario for distance learners will include live sessions. During the last years we developed e-learn...
Jeanne Schreurs
ICML
2010
IEEE
14 years 11 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
ICML
2000
IEEE
15 years 10 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
IUI
1997
ACM
15 years 2 months ago
Inductive Task Modeling for User Interface Customization
This paper describes ActionStreams, a system for inducing task models from observations of user activity. The model can represent several task structures: hierarchy, variable sequ...
David Maulsby
ICML
2003
IEEE
15 years 10 months ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan