Sciweavers

1799 search results - page 161 / 360
» Filtered Reinforcement Learning
Sort
View
PRICAI
2000
Springer
15 years 8 months ago
Constructing an Autonomous Agent with an Interdependent Heuristics
When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...
Koichi Moriyama, Masayuki Numao
AAAI
2010
15 years 6 months ago
Relative Entropy Policy Search
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...
Jan Peters, Katharina Mülling, Yasemin Altun
163
Voted
ICAC
2009
IEEE
15 years 2 months ago
Using distributed w-learning for multi-policy optimization in decentralized autonomic systems
Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...
Ivana Dusparic, Vinny Cahill
121
Voted
COGSR
2010
149views more  COGSR 2010»
14 years 11 months ago
Cognitive concepts in autonomous soccer playing robots
Computational concepts of cognition, their implementation in complex autonomous systems, and their empirical evaluation are key techniques to understand and validate concepts of c...
Martin Lauer, Roland Hafner, Sascha Lange, Martin ...
AAAI
2012
13 years 7 months ago
Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains
We present the first real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...
Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...