Search Sciweavers | Sciweavers

210 search results - page 29 / 42

» An analysis of reinforcement learning with function approxim...

172

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 6 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

139

click to vote

EMMCVPR
2005
Springer

143views Computer Vision» more EMMCVPR 2005»

Exploiting Inference for Approximate Parameter Learning in Discriminative Fields: An Empirical Study

15 years 10 months ago

Download www.cs.cmu.edu

Abstract. Estimation of parameters of random ﬁeld models from labeled training data is crucial for their good performance in many image analysis applications. In this paper, we p...

Sanjiv Kumar, Jonas August, Martial Hebert

claim paper

Read More »

142

click to vote

JMLR
2010

136views more JMLR 2010»

Approximate Riemannian Conjugate Gradient Learning for Fixed-Form Variational Bayes

15 years 8 hour ago

Download jmlr.csail.mit.edu

Variational Bayesian (VB) methods are typically only applied to models in the conjugate-exponential family using the variational Bayesian expectation maximisation (VB EM) algorith...

Antti Honkela, Tapani Raiko, Mikael Kuusela, Matti...

claim paper

Read More »

154

click to vote

CVPR
2011
IEEE

314views Computer Vision» more CVPR 2011»

Nonparametric Density Estimation on A Graph: Learning Framework, Fast Approximation and Application in Image Segmentation

15 years 1 months ago

Download ihome.ust.hk

We present a novel framework for tree-structure embedded density estimation and its fast approximation for mode seeking. The proposed method could ﬁnd diverse applications in co...

Zhiding Yu, Oscar Au, Ketan Tang

claim paper

Read More »

140

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 8 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

« Prev « First page 29 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers