Search Sciweavers | Sciweavers

15

ICPR
2004
IEEE

170views computer vision» more ICPR 2004»

Improvement of Bidirectional Recurrent Neural Network for Learning Long-Term Dependencies

14 years 6 months ago

Bidirectional recurrent neural network(BRNN) is a noncausal generalization of recurrent neural network(RNN). It can not learn remote information efficiently due to the problem of ...

Jinmiao Chen, Narendra S. Chaudhari

claim paper

Read More »

14

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 5 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

16

click to vote

IDEAL
2004
Springer

94views Intelligent Agents» more IDEAL 2004»

Policy Gradient Method for Team Markov Games

13 years 10 months ago

Download www.cis.hut.fi

The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team pro...

Ville Könönen

claim paper

Read More »

17

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

13 years 3 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

12

click to vote

ESANN
2000

152views Neural Networks» more ESANN 2000»

An algorithm for the addition of time-delayed connections to recurrent neural networks

13 years 6 months ago

Download www.dice.ucl.ac.be

: Recurrent neural networks possess interesting universal approximation capabilities, making them good candidates for time series modeling. Unfortunately, long term dependencies ar...

Romuald Boné, Michel Crucianu, Jean Pierre ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers