Search Sciweavers | Sciweavers

12

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

13 years 10 months ago

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

13

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

13 years 5 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

13

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

14 years 5 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

8

click to vote

HCI
2009

192views Human Computer Interaction» more HCI 2009»

Development of Open Platform Based Adaptive HCI Concepts for Elderly Users

13 years 2 months ago

Download www.oasis-project.eu

This paper describes the framework and development process of adaptive user interfaces within the OASIS project. After presenting a rationale for user interface adaptation to addre...

Jan-Paul Leuteritz, Harald Widlroither, Alexandros...

claim paper

Read More »

15

click to vote

ICRA
2002
IEEE

133views Robotics» more ICRA 2002»

The Necessity of Average Rewards in Cooperative Multirobot Learning

13 years 9 months ago

Download www.ri.cmu.edu

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discou...

Poj Tangamchit, John M. Dolan, Pradeep K. Khosla

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers