Search Sciweavers | Sciweavers

18 search results - page 3 / 4

» Incremental Least Squares Policy Iteration for POMDPs

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 6 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

13 years 12 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

PUMA: Planning Under Uncertainty with Macro-Actions

13 years 6 months ago

Download www.cs.berkeley.edu

Planning in large, partially observable domains is challenging, especially when a long-horizon lookahead is necessary to obtain a good policy. Traditional POMDP planners that plan...

Ruijie He, Emma Brunskill, Nicholas Roy

claim paper

Read More »

click to vote

ICMLA
2008

195views Machine Learning» more ICMLA 2008»

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture

13 years 6 months ago

Download www.grappa.univ-lille3.fr

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...

Sertan Girgin, Philippe Preux

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

183views Signal Processing» more ICASSP 2011»

Adaptive modelling with tunable RBF network using multi-innovation RLS algorithm assisted by swarm intelligence

12 years 9 months ago

Download mirlab.org

— In this paper, we propose a new on-line learning algorithm for the non-linear system identiﬁcation: the swarm intelligence aided multi-innovation recursive least squares (SIM...

Hao Chen, Yu Gong, Xia Hong

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers