Sciweavers

343 search results - page 49 / 69
» Action discovery for reinforcement learning
Sort
View
IJON
2007
73views more  IJON 2007»
14 years 9 months ago
Affordances, effectivities, and assisted imitation: Caregivers and the directing of attention
We focus on how infants’ discovery of a range of affordances and effectivities contributes to participating in a new activity. We emphasize how caregivers bracket ongoing action...
Patricia Zukow-Goldring, Michael A. Arbib
ECML
2006
Springer
15 years 1 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
IJCAI
2003
14 years 11 months ago
Simultaneous Adversarial Multi-Robot Learning
Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...
Michael H. Bowling, Manuela M. Veloso
ICML
2004
IEEE
15 years 10 months ago
Learning and discovery of predictive state representations in dynamical systems with reset
Predictive state representations (PSRs) are a recently proposed way of modeling controlled dynamical systems. PSR-based models use predictions of observable outcomes of tests that...
Michael R. James, Satinder P. Singh
ISADS
1999
IEEE
15 years 2 months ago
Emergence of Communication for Negotiation by a Recurrent Neural Network
We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...
Katsunari Shibata, Koji Ito