Search Sciweavers | Sciweavers

509 search results - page 1 / 102

» Compositional Models for Reinforcement Learning

137

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 6 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

142

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 11 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

180

click to vote

ICML
1998
IEEE

202views Machine Learning» more ICML 1998»

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

15 years 9 months ago

Download www.cs.mcgill.ca

We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa( )-algorithm. Then we solve the ...

Jette Randløv, Preben Alstrøm

claim paper

Read More »

178

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

15 years 10 months ago

Download k2x.ice.ous.ac.jp

This paper presents a novel model of reinforcement learning agents. A feature of our learning agent model is to integrate analytic hierarchy process (AHP) into a standard reinforc...

Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narih...

claim paper

Read More »

188

click to vote

SCAI
2008

246views Artificial Intelligence» more SCAI 2008»

Fast Learning in an Actor-Critic Architecture with Reward and Punishment

15 years 6 months ago

Download www.lucs.lu.se

Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...

Christian Balkenius, Stefan Winberg

claim paper

Read More »

« Prev « First page 1 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers