Search Sciweavers | Sciweavers

813 search results - page 86 / 163

» Ensemble Algorithms in Reinforcement Learning

126

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 6 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

142

click to vote

CSREAEEE
2008

199views Business» more CSREAEEE 2008»

Progranimate - A Web Enabled Algorithmic Problem Solving Application

15 years 5 months ago

Download www.comp.glam.ac.uk

- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...

Andrew Scott, Mike Watkins, Duncan McPhee

claim paper

Read More »

132

click to vote

ICML
2004
IEEE

122views Machine Learning» more ICML 2004»

Ensembles of nested dichotomies for multi-class problems

15 years 10 months ago

Download www.cs.waikato.ac.nz

Nested dichotomies are a standard statistical technique for tackling certain polytomous classiﬁcation problems with logistic regression. They can be represented as binary trees ...

Eibe Frank, Stefan Kramer

claim paper

Read More »

124

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 10 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

144

click to vote

SOCROB
2010

126views Robotics» more SOCROB 2010»

Using the Interaction Rhythm as a Natural Reinforcement Signal for Social Robots: A Matter of Belief

15 years 2 months ago

Download fostsvn.uopnet.plymouth.ac.uk

Abstract. In this paper, we present the results of a pilot study of a human robot interaction experiment where the rhythm of the interaction is used as a reinforcement signal to le...

Antoine Hiolle, Lola Cañamero, Pierre Andry...

claim paper

Read More »

« Prev « First page 86 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers