Search Sciweavers | Sciweavers

1236 search results - page 62 / 248

» Opposition-Based Reinforcement Learning

113

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 8 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

134

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 10 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

139

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

15 years 10 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

127

click to vote

ROBOCUP
2007
Springer

102views Robotics» more ROBOCUP 2007»

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

15 years 10 months ago

Download www.fei.edu.br

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...

Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...

claim paper

Read More »

110

click to vote

SBIA
2004
Springer

137views Artificial Intelligence» more SBIA 2004»

Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning

15 years 9 months ago

Download www.fei.edu.br

This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 62 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers