Search Sciweavers | Sciweavers

1233 search results - page 89 / 247

» Reinforcement Learning in MirrorBot

132

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 5 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

144

Voted

HIS
2008

122views Information Technology» more HIS 2008»

New Crossover Operator for Evolutionary Rule Discovery in XCS

15 years 5 months ago

Download www.salle.url.edu

XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve rule sets on-line by means of the interaction with an envi...

Sergio Morales-Ortigosa, Albert Orriols-Puig, Este...

claim paper

Read More »

178

Voted

AIIDE
2009

263views Artificial Intelligence» more AIIDE 2009»

Examining Extended Dynamic Scripting in a Tactical Game Framework

15 years 4 months ago

Download www.stottlerhenke.com

Dynamic scripting is a reinforcement learning algorithm designed specifically to learn appropriate tactics for an agent in a modern computer game, such as Neverwinter Nights. This...

Jeremy Ludwig, Arthur Farley

claim paper

Read More »

113

Voted

DAGSTUHL
2003

116views Software Engineering» more DAGSTUHL 2003»

Maximizing Learning Progress: An Internal Reward System for Development

15 years 5 months ago

Download www.csl.sony.fr

This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predeﬁned task. It...

Frédéric Kaplan, Pierre-Yves Oudeyer

claim paper

Read More »

213

Voted

Publication

233views

Sparse reward processes

14 years 2 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 89 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers