Search Sciweavers | Sciweavers

582 search results - page 69 / 117

» Gaussian Processes in Reinforcement Learning

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 19 days ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

click to vote

ICMLA
2010

161views Machine Learning» more ICMLA 2010»

Robust Learning for Adaptive Programs by Leveraging Program Structure

14 years 9 months ago

Download web.engr.oregonstate.edu

Abstract--We study how to effectively integrate reinforcement learning (RL) and programming languages via adaptation-based programming, where programs can include non-deterministic...

Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig

claim paper

Read More »

117

click to vote

CVPR
2007
IEEE

390views Computer Vision» more CVPR 2007»

What makes a good model of natural images?

16 years 1 months ago

Download www.cs.huji.ac.il

Many low-level vision algorithms assume a prior probability over images, and there has been great interest in trying to learn this prior from examples. Since images are very non G...

Yair Weiss, William T. Freeman

claim paper

Read More »

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 19 days ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

131

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 6 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 69 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers