Search Sciweavers | Sciweavers

651 search results - page 106 / 131

» Algorithms for Inverse Reinforcement Learning

101

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

15 years 6 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

click to vote

ECML
2003
Springer

87views Machine Learning» more ECML 2003»

Self-evaluated Learning Agent in Multiple State Games

15 years 5 months ago

Download www.ai.sanken.osaka-u.ac.jp

Abstract. Most of multi-agent reinforcement learning algorithms aim to converge to a Nash equilibrium, but a Nash equilibrium does not necessarily mean a desirable result. On the o...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 3 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

click to vote

ATAL
2008
Springer

145views Intelligent Agents» more ATAL 2008»

Artificial agents learning human fairness

15 years 1 months ago

Download www.sce.carleton.ca

Recent advances in technology allow multi-agent systems to be deployed in cooperation with or as a service for humans. Typically, those systems are designed assuming individually ...

Steven de Jong, Karl Tuyls, Katja Verbeeck

claim paper

Read More »

111

click to vote

FLAIRS
2009

135views Artificial Intelligence» more FLAIRS 2009»

Beating the Defense: Using Plan Recognition to Inform Learning Agents

14 years 9 months ago

Download www.knexusresearch.com

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a casebased reinforcement learner in an adversarial action selectio...

Matthew Molineaux, David W. Aha, Gita Sukthankar

claim paper

Read More »

« Prev « First page 106 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers