Search Sciweavers | Sciweavers

250 search results - page 35 / 50

» Learning action effects in partially observable domains

click to vote

ICML
2008
IEEE

120views Machine Learning» more ICML 2008»

Exploration scavenging

16 years 16 days ago

Download hunch.net

We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...

John Langford, Alexander L. Strehl, Jennifer Wortm...

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 16 days ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

Voted

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

14 years 12 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

click to vote

IJRR
2008

186views more IJRR 2008»

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

14 years 11 months ago

Download groups.csail.mit.edu

Designing distributed controllers for self-reconfiguring modular robots has been consistently challenging. We have developed a reinforcement learning approach which can be used bo...

Paulina Varshavskaya, Leslie Pack Kaelbling, Danie...

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 16 days ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

« Prev « First page 35 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers