Search Sciweavers | Sciweavers

575 search results - page 18 / 115

» Reinforcement Learning State Estimator

195

Voted

ATAL
2011
Springer

199views Intelligent Agents» more ATAL 2011»

Metric learning for reinforcement learning agents

14 years 4 months ago

Download www.eecs.berkeley.edu

A key component of any reinforcement learning algorithm is the underlying representation used by the agent. While reinforcement learning (RL) agents have typically relied on hand-...

Matthew E. Taylor, Brian Kulis, Fei Sha

claim paper

Read More »

138

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

15 years 5 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

117

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 9 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

124

Voted

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Bellman goes relational

16 years 5 months ago

Download people.csail.mit.edu

Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...

Kristian Kersting, Martijn Van Otterlo, Luc De Rae...

claim paper

Read More »

128

click to vote

ICML
2010
IEEE

171views Machine Learning» more ICML 2010»

Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis

15 years 5 months ago

Download www.stat.lsa.umich.edu

We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated a...

Daniel J. Lizotte, Michael H. Bowling, Susan A. Mu...

claim paper

Read More »

« Prev « First page 18 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers