Search Sciweavers | Sciweavers

4345 search results - page 56 / 869

» Relational Reinforcement Learning

Voted

TSMC
2008

76views more TSMC 2008»

Improved Adaptive-Reinforcement Learning Control for Morphing Unmanned Air Vehicles

15 years 3 months ago

Download jungfrau.tamu.edu

This paper presents an improved Adaptive

John Valasek, James Doebbler, Monish D. Tandale, A...

claim paper

Read More »

159

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 9 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

123

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 4 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

111

click to vote

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

15 years 4 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

113

Voted

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

15 years 5 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

« Prev « First page 56 / 869 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers