Search Sciweavers | Sciweavers

417 search results - page 11 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

198

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 8 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

136

click to vote

CEC
2009
IEEE

133views Artificial Intelligence» more CEC 2009»

Structure learning and optimisation in a Markov-network based estimation of distribution algorithm

15 years 9 months ago

Download www.comp.rgu.ac.uk

—Structure learning is a crucial component of a multivariate Estimation of Distribution algorithm. It is the part which determines the interactions between variables in the proba...

Alexander E. I. Brownlee, John A. W. McCall, Siddh...

claim paper

Read More »

139

Voted

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 23 hour ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

117

click to vote

IJRR
2008

186views more IJRR 2008»

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

15 years 2 months ago

Download groups.csail.mit.edu

Designing distributed controllers for self-reconfiguring modular robots has been consistently challenging. We have developed a reinforcement learning approach which can be used bo...

Paulina Varshavskaya, Leslie Pack Kaelbling, Danie...

claim paper

Read More »

128

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 26 days ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 11 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers