Search Sciweavers | Sciweavers

463 search results - page 11 / 93

» Localizing Search in Reinforcement Learning

161

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

189

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 9 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

289

click to vote

Book

392views

Reinforcement Learning: An Introduction

17 years 5 months ago

Download www.cs.ualberta.ca

"Reinforcement learning is learning what to do how to map situations to actions so as to maximize a numerical reward signal. The learner is not told which actions to take, as ...

Richard S. Sutton, Andrew G. Barto

posted by scimaster

Read More »

217

click to vote

IJCAI
2001

84views Artificial Intelligence» more IJCAI 2001»

Reinforcement Learning in Distributed Domains: Beyond Team Games

15 years 8 months ago

Download web.engr.oregonstate.edu

Using a distributed algorithm rather than a centralized one can be extremely beneficial in large search problems. In addition, the incorporation of machine learning techniques lik...

David Wolpert, Joseph Sill, Kagan Tumer

claim paper

Read More »

184

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 7 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 11 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers