Search Sciweavers | Sciweavers

150

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 5 months ago

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

161

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 10 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

146

click to vote

KBS
2006

105views more KBS 2006»

Robot docking based on omnidirectional vision and reinforcement learning

15 years 5 months ago

Download www.eecs.wsu.edu

We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...

David Muse, Cornelius Weber, Stefan Wermter

claim paper

Read More »

157

click to vote

ISADS
1999
IEEE

81views Emerging Technology» more ISADS 1999»

Emergence of Communication for Negotiation by a Recurrent Neural Network

15 years 10 months ago

Download shws.cc.oita-u.ac.jp

We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...

Katsunari Shibata, Koji Ito

claim paper

Read More »

163

click to vote

ACL
1992

148views Computational Linguistics» more ACL 1992»

Association-Based Natural Language Processing with Neural Networks

15 years 6 months ago

Download www.mt-archive.info

This paper describes a natural language processing system reinforced by the use of association of words and concepts, implemented as a neural network. Combining an associative net...

Kazuhiro Kimura, Takashi Suzuoka, Shin'ya Amano

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers