Sciweavers

1233 search results - page 223 / 247
» Feudal Reinforcement Learning
Sort
View
ICML
2010
IEEE
15 years 26 days ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
IIE
2007
105views more  IIE 2007»
14 years 11 months ago
Student-Centered Support Systems to Sustain Logo-Like Learning
Conventional wisdom attributes the lack of effective technology use in classrooms to a shortage of professional development or poorly run professional development. At the same time...
Sylvia Martinez
ICRA
1994
IEEE
105views Robotics» more  ICRA 1994»
15 years 3 months ago
Harmonic Functions and Collision Probabilities
There is a close relationship between harmonic functions { which have recently been proposed for path planning { and hitting probabilities for random processes. The hitting probab...
Christopher I. Connolly
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 3 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ESANN
2008
15 years 1 months ago
Improvement in Game Agent Control Using State-Action Value Scaling
The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...
Leo Galway, Darryl Charles, Michaela M. Black