Search Sciweavers | Sciweavers

397 search results - page 34 / 80

» Reinforcement Learning with Hierarchies of Machines

138

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 19 days ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

100

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

15 years 6 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

15 years 8 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

click to vote

ECML
1997
Springer

79views Machine Learning» more ECML 1997»

Ibots Learn Genuine Team Solutions

15 years 4 months ago

Download www.idsia.ch

\Ibots" (Integrating roBOTS) is a computer experiment in group learning. It is designed to understand how to use reinforcement learning to program automatically a team of robo...

Cristina Versino, Luca Maria Gambardella

claim paper

Read More »

106

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 5 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

« Prev « First page 34 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers