Sciweavers

ESANN
2006

114views Neural Networks» more ESANN 2006»

Reducing policy degradation in neuro-dynamic programming

13 years 10 months ago

We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers