Sciweavers

115 search results - page 1 / 23
» Recurrent policy gradients
Sort
View
65
Voted
ICANN
2007
Springer
15 years 5 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
IGPL
2010
83views more  IGPL 2010»
14 years 10 months ago
Recurrent policy gradients
Daan Wierstra, Alexander Förster, Jan Peters,...
ICANN
2010
Springer
14 years 12 months ago
Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients
Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...
Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...
114
Voted
JMLR
2010
227views more  JMLR 2010»
14 years 10 months ago
PyBrain
PyBrain is a versatile machine learning library for Python. Its goal is to provide flexible, easyto-use yet still powerful algorithms for machine learning tasks, including a vari...
Tom Schaul, Justin Bayer, Daan Wierstra, Yi Sun, M...
JMLR
2010
189views more  JMLR 2010»
14 years 6 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...