Sciweavers

29 search results - page 4 / 6
» Reinforcement Learning for Mapping Instructions to Actions
Sort
View
JMLR
2012
11 years 8 months ago
Contextual Bandit Learning with Predictable Rewards
Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...
Alekh Agarwal, Miroslav Dudík, Satyen Kale,...
IUI
2005
ACM
13 years 11 months ago
Task learning by instruction in tailor
In order for intelligent systems to be applicable in a wide range of situations, end users must be able to modify their task descriptions. We introduce Tailor, a system that allow...
Jim Blythe
KBS
2006
105views more  KBS 2006»
13 years 5 months ago
Robot docking based on omnidirectional vision and reinforcement learning
We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...
David Muse, Cornelius Weber, Stefan Wermter
AAAI
2000
13 years 7 months ago
Localizing Search in Reinforcement Learning
Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...
Gregory Z. Grudic, Lyle H. Ungar
RAS
2000
161views more  RAS 2000»
13 years 5 months ago
Active object recognition by view integration and reinforcement learning
A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...
Lucas Paletta, Axel Pinz