Search Sciweavers | Sciweavers

4345 search results - page 154 / 869

» Relational Reinforcement Learning

118

Voted

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 5 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

Voted

ICCBR
2005
Springer

91views Automated Reasoning» more ICCBR 2005»

Opportunities for CBR in Learning by Doing

15 years 9 months ago

Download gaia.fdi.ucm.es

In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...

Pedro Pablo Gómez-Martín, Marco Anto...

claim paper

Read More »

115

Voted

ECAL
2007
Springer

227views Artificial Intelligence» more ECAL 2007»

Guided Self-organisation for Autonomous Robot Development

15 years 9 months ago

Download robot.informatik.uni-leipzig.de

Abstract. The paper presents a method to guide the self-organised development of behaviours of autonomous robots. In earlier publications we demonstrated how to use the homeokinesi...

Georg Martius, J. Michael Herrmann, Ralf Der

claim paper

Read More »

136

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 4 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

136

Voted

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 1 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 154 / 869 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers