Search Sciweavers | Sciweavers

575 search results - page 30 / 115

» Reinforcement Learning State Estimator

120

click to vote

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

Reinforcement Learning with Time

15 years 5 months ago

Download www.aaai.org

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

128

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 5 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

137

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

15 years 11 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

141

click to vote

ICMLA
2010

205views Machine Learning» more ICMLA 2010»

Incremental Learning of Relational Action Rules

15 years 1 months ago

Download www-lipn.univ-paris13.fr

Abstract--In the Relational Reinforcement learning framework, we propose an algorithm that learns an action model allowing to predict the resulting state of each action in any give...

Christophe Rodrigues, Pierre Gérard, C&eacu...

claim paper

Read More »

138

Voted

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 6 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

« Prev « First page 30 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers