Search Sciweavers | Sciweavers

29 search results - page 4 / 6

» Reinforcement Learning for Mapping Instructions to Actions

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

11 years 8 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

click to vote

IUI
2005
ACM

108views Software Engineering» more IUI 2005»

Task learning by instruction in tailor

13 years 11 months ago

Download ai.isi.edu

In order for intelligent systems to be applicable in a wide range of situations, end users must be able to modify their task descriptions. We introduce Tailor, a system that allow...

Jim Blythe

claim paper

Read More »

click to vote

KBS
2006

105views more KBS 2006»

Robot docking based on omnidirectional vision and reinforcement learning

13 years 5 months ago

Download www.eecs.wsu.edu

We present a system for visual robotic docking using an omnidirectional camera coupled with the actor critic reinforcement learning algorithm. The system enables a PeopleBot robot...

David Muse, Cornelius Weber, Stefan Wermter

claim paper

Read More »

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

13 years 7 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

13 years 5 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers