Search Sciweavers | Sciweavers

60 search results - page 2 / 12

» Revisiting Natural Actor-Critics with Value Function Approxi...

click to vote

DFG
2007
Springer

148views Operating System» more DFG 2007»

Natural Neighbor Concepts in Scattered Data Interpolation and Discrete Function Approximation

13 years 11 months ago

Download www-umlauf.informatik.uni-kl.de

: The concept of natural neighbors employs the notion of distance to deﬁne local neighborhoods in discrete data. Especially when querying and accessing large scale data, it is im...

Tom Bobach, Georg Umlauf

claim paper

Read More »

click to vote

TSMC
2008

132views more TSMC 2008»

Ensemble Algorithms in Reinforcement Learning

13 years 4 months ago

Download people.cs.uu.nl

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...

Marco A. Wiering, Hado van Hasselt

claim paper

Read More »

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

14 years 5 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

click to vote

ECCV
2006
Springer

123views Computer Vision» more ECCV 2006»

Trace Quotient Problems Revisited

13 years 8 months ago

Download mmlab.ie.cuhk.edu.hk

The formulation of trace quotient is shared by many computer vision problems; however, it was conventionally approximated by an essentially different formulation of quotient trace,...

Shuicheng Yan, Xiaoou Tang

claim paper

Read More »

click to vote

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

13 years 10 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

« Prev « First page 2 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers