Search Sciweavers | Sciweavers

1799 search results - page 161 / 360

» Filtered Reinforcement Learning

122

click to vote

PRICAI
2000
Springer

127views Artificial Intelligence» more PRICAI 2000»

Constructing an Autonomous Agent with an Interdependent Heuristics

15 years 8 months ago

Download www.ai.sanken.osaka-u.ac.jp

When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we ...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

141

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 6 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

163

Voted

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 2 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

121

Voted

COGSR
2010

149views more COGSR 2010»

Cognitive concepts in autonomous soccer playing robots

14 years 11 months ago

Download ml.informatik.uni-freiburg.de

Computational concepts of cognition, their implementation in complex autonomous systems, and their empirical evaluation are key techniques to understand and validate concepts of c...

Martin Lauer, Roland Hafner, Sascha Lange, Martin ...

claim paper

Read More »

182

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 7 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

« Prev « First page 161 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers