Search Sciweavers | Sciweavers

1235 search results - page 38 / 247

» Reinforcement learning in a nutshell

155

Voted

CIG
2005
IEEE

120views Applied Computing» more CIG 2005»

Adapting Reinforcement Learning for Computer Games: Using Group Utility Functions

15 years 11 months ago

Download cswww.essex.ac.uk

AbstractGroup utility functions are an extension of the common team utility function for providing multiple agents with a common reinforcement learning signal for learning cooperat...

Jay Bradley, Gillian Hayes

claim paper

Read More »

click to vote

TSMC
2008

76views more TSMC 2008»

Improved Adaptive-Reinforcement Learning Control for Morphing Unmanned Air Vehicles

15 years 5 months ago

Download jungfrau.tamu.edu

This paper presents an improved Adaptive

John Valasek, James Doebbler, Monish D. Tandale, A...

claim paper

Read More »

181

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 11 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

139

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

169

Voted

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

15 years 10 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

« Prev « First page 38 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers