Sciweavers

548 search results - page 18 / 110
» A New Way to Introduce Knowledge into Reinforcement Learning
Sort
View
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 5 months ago
Co-evolving recurrent neurons learn deep memory POMDPs
Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...
Faustino J. Gomez, Jürgen Schmidhuber
HICSS
2006
IEEE
160views Biometrics» more  HICSS 2006»
15 years 5 months ago
A Case Study of a Longstanding Online Community of Practice Involving Critical Care and Advanced Practice Nurses
The aims of this study are: (1) to examine to what extent critical care and advanced practice nurses’ participation in an online listserv constituted a community of practice, an...
Noriko Hara, Khe Foon Hew
NIPS
2007
15 years 1 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
IWANN
1999
Springer
15 years 4 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
108
Voted
IJCAI
2001
15 years 1 months ago
Reinforcement Learning in Distributed Domains: Beyond Team Games
Using a distributed algorithm rather than a centralized one can be extremely beneficial in large search problems. In addition, the incorporation of machine learning techniques lik...
David Wolpert, Joseph Sill, Kagan Tumer