Search Sciweavers | Sciweavers

73

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

14 years 12 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

79

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 2 days ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

103

click to vote

NIPS
2001

131views Information Technology» more NIPS 2001»

The Steering Approach for Multi-Criteria Reinforcement Learning

14 years 11 months ago

Download books.nips.cc

We consider the problem of learning to attain multiple goals in a dynamic environment, which is initially unknown. In addition, the environment may contain arbitrarily varying ele...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

86

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 3 days ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

82

click to vote

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

14 years 10 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers