Sciweavers

116

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

14 years 9 months ago

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers