Search Sciweavers | Sciweavers

548 search results - page 18 / 110

» A New Way to Introduce Knowledge into Reinforcement Learning

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

15 years 7 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

click to vote

HICSS
2006
IEEE

160views Biometrics» more HICSS 2006»

A Case Study of a Longstanding Online Community of Practice Involving Critical Care and Advanced Practice Nurses

15 years 8 months ago

Download csdl2.computer.org

The aims of this study are: (1) to examine to what extent critical care and advanced practice nurses’ participation in an online listserv constituted a community of practice, an...

Noriko Hara, Khe Foon Hew

claim paper

Read More »

118

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 3 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

115

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 6 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

134

Voted

IJCAI
2001

84views Artificial Intelligence» more IJCAI 2001»

Reinforcement Learning in Distributed Domains: Beyond Team Games

15 years 3 months ago

Download web.engr.oregonstate.edu

Using a distributed algorithm rather than a centralized one can be extremely beneficial in large search problems. In addition, the incorporation of machine learning techniques lik...

David Wolpert, Joseph Sill, Kagan Tumer

claim paper

Read More »

« Prev « First page 18 / 110 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers