Sciweavers

274 search results - page 18 / 55
» Network reinforcement
Sort
View
NN
2006
Springer
127views Neural Networks» more  NN 2006»
14 years 9 months ago
The asymptotic equipartition property in reinforcement learning and its relation to return maximization
We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...
Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai
76
Voted
IWANN
1999
Springer
15 years 1 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
KBS
2006
105views more  KBS 2006»
14 years 9 months ago
Robot docking based on omnidirectional vision and reinforcement learning
We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...
David Muse, Cornelius Weber, Stefan Wermter
88
Voted
ISADS
1999
IEEE
15 years 1 months ago
Emergence of Communication for Negotiation by a Recurrent Neural Network
We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...
Katsunari Shibata, Koji Ito
ACL
1992
14 years 10 months ago
Association-Based Natural Language Processing with Neural Networks
This paper describes a natural language processing system reinforced by the use of association of words and concepts, implemented as a neural network. Combining an associative net...
Kazuhiro Kimura, Takashi Suzuoka, Shin'ya Amano