Sciweavers

7280 search results - page 1 / 1456
» Adaptive bases for Q-learning
Sort
View
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
13 years 10 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
NIPS
2003
13 years 5 months ago
Extending Q-Learning to General Adaptive Multi-Agent Systems
Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...
Gerald Tesauro
ICML
1998
IEEE
14 years 5 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
HCI
2009
13 years 2 months ago
Development of Open Platform Based Adaptive HCI Concepts for Elderly Users
This paper describes the framework and development process of adaptive user interfaces within the OASIS project. After presenting a rationale for user interface adaptation to addre...
Jan-Paul Leuteritz, Harald Widlroither, Alexandros...
ICRA
2002
IEEE
133views Robotics» more  ICRA 2002»
13 years 9 months ago
The Necessity of Average Rewards in Cooperative Multirobot Learning
Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discou...
Poj Tangamchit, John M. Dolan, Pradeep K. Khosla