Sciweavers

4446 search results - page 45 / 890
» Learning Observer Agents
Sort
View
69
Voted
AAMAS
2002
Springer
15 years 3 months ago
Using Landscape Theory to Measure Learning Difficulty for Adaptive Agents
Christopher H. Brooks, Edmund H. Durfee
75
Voted
ATAL
2010
Springer
14 years 10 months ago
Self-organisation in an agent network via learning
Dayong Ye, Minjie Zhang, Danny Sutanto
149
Voted
IJCAI
2001
15 years 4 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz