Sciweavers

1233 search results - page 102 / 247
» Reinforcement Learning in MirrorBot
Sort
View
104
Voted
ICML
2000
IEEE
16 years 4 months ago
Algorithm Selection using Reinforcement Learning
Michail G. Lagoudakis, Michael L. Littman
103
Voted
ICML
2000
IEEE
16 years 4 months ago
Practical Reinforcement Learning in Continuous Spaces
William D. Smart, Leslie Pack Kaelbling
108
Voted
ICML
1997
IEEE
16 years 4 months ago
Expected Mistake Bound Model for On-Line Reinforcement Learning
Claude-Nicolas Fiechter
220
Voted
ICAART
2010
INSTICC
16 years 23 days ago
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis
137
Voted
ATAL
2004
Springer
15 years 9 months ago
Time-Extended Policies in Multi-Agent Reinforcement Learning
Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement l...
Kagan Tumer, Adrian K. Agogino