Search Sciweavers | Sciweavers

1235 search results - page 139 / 247

» ABC Reinforcement Learning

168

click to vote

ECML
2006
Springer

146views Machine Learning» more ECML 2006»

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

15 years 6 months ago

Download www.montefiore.ulg.ac.be

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJ...

Sébastien Jodogne, Justus H. Piater

claim paper

Read More »

129

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 4 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 4 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

103

click to vote

ICCBR
2009
Springer

159views Automated Reasoning» more ICCBR 2009»

Case-Based Reasoning in Transfer Learning

15 years 9 months ago

Download www.knexusresearch.com

Positive transfer learning (TL) occurs when, after gaining experience from learning how to solve a (source) task, the same learner can exploit this experience to improve performanc...

David W. Aha, Matthew Molineaux, Gita Sukthankar

claim paper

Read More »

107

click to vote

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

15 years 9 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

« Prev « First page 139 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers