Search Sciweavers | Sciweavers

27 search results - page 4 / 6

» Learning nonparametric policies by imitation

click to vote

ATAL
2007
Springer

180views Intelligent Agents» more ATAL 2007»

Confidence-based policy learning from demonstration using Gaussian mixture models

13 years 9 months ago

Download www.cs.cmu.edu

We contribute an approach for interactive policy learning through expert demonstration that allows an agent to actively request and effectively represent demonstration examples. I...

Sonia Chernova, Manuela M. Veloso

claim paper

Read More »

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

13 years 6 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

14 years 6 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

click to vote

COLT
2010
Springer

129views Machine Learning» more COLT 2010»

Nonparametric Bandits with Covariates

13 years 2 months ago

Download www.princeton.edu

We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...

Philippe Rigollet, Assaf Zeevi

claim paper

Read More »

click to vote

IJCNN
2006
IEEE

127views Neural Networks» more IJCNN 2006»

Reinforcement Learning for Parameterized Motor Primitives

13 years 11 months ago

Download www-clmc.usc.edu

Abstract— One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the “building blocks of movement genera...

Jan Peters, Stefan Schaal

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers