Search Sciweavers | Sciweavers

115 search results - page 1 / 23

» Recurrent policy gradients

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 8 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

Voted

IGPL
2010

83views more IGPL 2010»

Recurrent policy gradients

15 years 11 days ago

Download www.idsia.ch

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

112

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 2 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

136

click to vote

JMLR
2010

227views more JMLR 2010»

PyBrain

15 years 9 days ago

Download www.idsia.ch

PyBrain is a versatile machine learning library for Python. Its goal is to provide ﬂexible, easyto-use yet still powerful algorithms for machine learning tasks, including a vari...

Tom Schaul, Justin Bayer, Daan Wierstra, Yi Sun, M...

claim paper

Read More »

150

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 8 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 1 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers