Sciweavers

4446 search results - page 45 / 890
» Learning Observer Agents
Sort
View
ATAL
2010
Springer
14 years 4 months ago
Self-organisation in an agent network via learning
Dayong Ye, Minjie Zhang, Danny Sutanto
IJCAI
2001
14 years 11 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz