Sciweavers

1630 search results - page 28 / 326
» Coordinated Reinforcement Learning
Sort
View
121
Voted
ML
2008
ACM
15 years 5 months ago
Transfer in variable-reward hierarchical reinforcement learning
Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...
112
Voted
ML
2000
ACM
133views Machine Learning» more  ML 2000»
15 years 5 months ago
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...
ICAART
2010
INSTICC
16 years 2 months ago
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis
ATAL
2004
Springer
15 years 11 months ago
Time-Extended Policies in Multi-Agent Reinforcement Learning
Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement l...
Kagan Tumer, Adrian K. Agogino
107
Voted
NIPS
2000
15 years 7 months ago
Balancing Multiple Sources of Reward in Reinforcement Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...
Christian R. Shelton