Sciweavers

29 search results - page 5 / 6
» Reinforcement Learning for Mapping Instructions to Actions
Sort
View
ROMAN
2007
IEEE
150views Robotics» more  ROMAN 2007»
13 years 11 months ago
Asymmetric Interpretations of Positive and Negative Human Feedback for a Social Learning Agent
— The ability for people to interact with robots and teach them new skills will be crucial to the successful application of robots in everyday human environments. In order to des...
Andrea Lockerd Thomaz, Cynthia Breazeal
ECML
2007
Springer
13 years 11 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
NECO
2007
150views more  NECO 2007»
13 years 4 months ago
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...
Dorit Baras, Ron Meir
ECML
2006
Springer
13 years 9 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
MICAI
2010
Springer
13 years 3 months ago
Teaching a Robot to Perform Tasks with Voice Commands
The full deployment of service robots in daily activities will require the robot to adapt to the needs of non-expert users, particularly, to learn how to perform new tasks from “...
Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...