Sciweavers

2 search results - page 1 / 1
» Introduction and control of subgoals in reinforcement learni...
Sort
View
ICML
2010
IEEE
13 years 5 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...