Sciweavers

2 search results - page 1 / 1
» Introduction and control of subgoals in reinforcement learni...
Sort
View
ICML
2010
IEEE
15 years 21 days ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...