Search Sciweavers | Sciweavers

813 search results - page 135 / 163

» Ensemble Algorithms in Reinforcement Learning

145

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

15 years 10 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

144

Voted

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 7 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

133

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 2 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

152

Voted

FLAIRS
2009

135views Artificial Intelligence» more FLAIRS 2009»

Beating the Defense: Using Plan Recognition to Inform Learning Agents

15 years 1 months ago

Download www.knexusresearch.com

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a casebased reinforcement learner in an adversarial action selectio...

Matthew Molineaux, David W. Aha, Gita Sukthankar

claim paper

Read More »

209

click to vote

CVPR
2011
IEEE

499views Computer Vision» more CVPR 2011»

Learning Context for Collective Activity Recognition

14 years 11 months ago

Download www.eecs.umich.edu

In this paper we present a framework for the recognition of collective human activities. A collective activity is deﬁned or reinforced by the existence of coherent behavior of i...

Wongun Choi, Silvio Savarese, Khuram Shahid

claim paper

Read More »

« Prev « First page 135 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers