Search Sciweavers | Sciweavers

343 search results - page 49 / 69

» Action discovery for reinforcement learning

click to vote

IJON
2007

73views more IJON 2007»

Affordances, effectivities, and assisted imitation: Caregivers and the directing of attention

14 years 11 months ago

Download www.mentaldev.org

We focus on how infants’ discovery of a range of affordances and effectivities contributes to participating in a new activity. We emphasize how caregivers bracket ongoing action...

Patricia Zukow-Goldring, Michael A. Arbib

claim paper

Read More »

click to vote

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 3 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

click to vote

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

15 years 1 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

click to vote

ICML
2004
IEEE

142views Machine Learning» more ICML 2004»

Learning and discovery of predictive state representations in dynamical systems with reset

16 years 18 days ago

Download www.cc.gatech.edu

Predictive state representations (PSRs) are a recently proposed way of modeling controlled dynamical systems. PSR-based models use predictions of observable outcomes of tests that...

Michael R. James, Satinder P. Singh

claim paper

Read More »

101

click to vote

ISADS
1999
IEEE

81views Emerging Technology» more ISADS 1999»

Emergence of Communication for Negotiation by a Recurrent Neural Network

15 years 4 months ago

Download shws.cc.oita-u.ac.jp

We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...

Katsunari Shibata, Koji Ito

claim paper

Read More »

« Prev « First page 49 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers