Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

13

ICML
2004
IEEE

favoriteEmaildiscussreport

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

14 years 5 months ago

Convergence of synchronous reinforcement learning with linear function approximation

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merke, 2003). In this paper we state conditions of convergence for general inhomogeneous matrix iterations and prove that they are both necessary and sufficient. This result extends the work presented in (Schoknecht & Merke, 2003), where only a sufficient condition of convergence was proved. As the condition of convergence is necessary and sufficient, the new result is suitable to prove convergence and divergence of RL algorithms with function approximation. We use the theorem to deduce a new concise proof of convergence for the synchronous residual gradient algorithm (Baird, 1995). Moreover, we derive a counterexample for which the uniform RL algorithm (Merke & Schoknecht, 2002) diverges. This yields a negative answer to the open question if the uniform RL algorithm converges for arbitrary multiple trans...

Artur Merke, Ralf Schoknecht

Real-time Traffic

ICML 2004 | Inhomogeneous Matrix Iterations | Machine Learning | RL Algorithms | Uniform Rl Algorithm |

claim paper

Related Content

» Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bi...

» Tracking value function dynamics to improve reinforcement learning with piecewise linear f...

» TD0 Converges Provably Faster than the Residual Gradient Algorithm

» Parallel Reinforcement Learning with Linear Function Approximation

» Convergence and Divergence in Standard and Averaging Reinforcement Learning

» Residual Algorithms Reinforcement Learning with Function Approximation

» OffPolicy Temporal Difference Learning with Function Approximation

» A Convergent Reinforcement Learning Algorithm in the Continuous Case The FiniteElement Rei...

» DynaStyle Planning with Linear Function Approximation and Prioritized Sweeping

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2004
Where	ICML
Authors	Artur Merke, Ralf Schoknecht

Comments (0)