Search Sciweavers | Sciweavers

141 search results - page 15 / 29

» Fuzzy Kanerva-based function approximation for reinforcement...

176

click to vote

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

15 years 8 months ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

235

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

16 years 22 days ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

212

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 2 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

236

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 29 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

182

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 15 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers