Search Sciweavers | Sciweavers

2566 search results - page 47 / 514

» Relating reinforcement learning performance to classificatio...

139

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 6 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

212

click to vote

ABIALS
2008
Springer

255views Artificial Intelligence» more ABIALS 2008»

Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

15 years 7 months ago

Download axon.cs.byu.edu

Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...

Matthias Rungger, Hao Ding, Olaf Stursberg

claim paper

Read More »

147

click to vote

IJON
2006

90views more IJON 2006»

Reinforcement learning of a simple control task using the spike response model

15 years 5 months ago

Download www.xdr.com

In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...

Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...

claim paper

Read More »

140

click to vote

ICML
1998
IEEE

125views Machine Learning» more ICML 1998»

A Randomized ANOVA Procedure for Comparing Performance Curves

16 years 6 months ago

Download mas.cs.umass.edu

Three factors are related in analyses of performance curves such as learning curves: the amount of training, the learning algorithm, and performance. Often we want to know whether...

Justus H. Piater, Paul R. Cohen, Xiaoqin Zhang, Mi...

claim paper

Read More »

133

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

15 years 11 months ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 47 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers