Search Sciweavers | Sciweavers

539 search results - page 54 / 108

» Learning Monotonic Linear Functions

195

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 11 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

158

click to vote

CORR
2008
Springer

99views Education» more CORR 2008»

When is there a representer theorem? Vector versus matrix regularizers

15 years 6 months ago

Download jmlr.csail.mit.edu

We consider a general class of regularization methods which learn a vector of parameters on the basis of linear measurements. It is well known that if the regularizer is a nondecr...

Andreas Argyriou, Charles A. Micchelli, Massimilia...

claim paper

Read More »

179

click to vote

JMLR
2008

150views more JMLR 2008»

Discriminative Learning of Max-Sum Classifiers

15 years 6 months ago

Download jmlr.csail.mit.edu

The max-sum classifier predicts n-tuple of labels from n-tuple of observable variables by maximizing a sum of quality functions defined over neighbouring pairs of labels and obser...

Vojtech Franc, Bogdan Savchynskyy

claim paper

Read More »

201

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 4 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

194

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 8 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

« Prev « First page 54 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers