Search Sciweavers | Sciweavers

65

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

16 years 21 days ago

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...

Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...

claim paper

Read More »

102

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 21 days ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

88

click to vote

ICES
2003
Springer

125views Hardware» more ICES 2003»

Evolving Reinforcement Learning-Like Abilities for Robots

15 years 5 months ago

Download lis.epfl.ch

Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...

Jesper Blynel

claim paper

Read More »

77

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 21 days ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

86

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 21 days ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers