Search Sciweavers | Sciweavers

113

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 1 months ago

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

92

click to vote

ICES
2003
Springer

125views Hardware» more ICES 2003»

Evolving Reinforcement Learning-Like Abilities for Robots

15 years 6 months ago

Download lis.epfl.ch

Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...

Jesper Blynel

claim paper

Read More »

86

click to vote

AAAI
2008

105views Intelligent Agents» more AAAI 2008»

Potential-based Shaping in Model-based Reinforcement Learning

15 years 3 months ago

Download www.aaai.org

Potential-based shaping was designed as a way of introducing background knowledge into model-free reinforcement-learning algorithms. By identifying states that are likely to have ...

John Asmuth, Michael L. Littman, Robert Zinkov

claim paper

Read More »

107

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

15 years 2 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

95

click to vote

EUSFLAT
2001

144views Fuzzy Logic» more EUSFLAT 2001»

Adaptive torque control using a connectionist reinforcement learning agent

15 years 2 months ago

Download www.eusflat.org

The correction of angular misalignment between mating components is a fundamental requirement for their successful assembly. In this paper we present how a learning agent based on...

Lorenzo Brignone, Martin Howarth, S. Sivayoganatha...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers