Search Sciweavers | Sciweavers

127 search results - page 2 / 26

» A linear approximation method for the Shapley value

click to vote

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

14 years 6 months ago

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...

Chee Wee Phua, Robert Fitch

claim paper

Read More »

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

13 years 11 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

click to vote

MOC
1998

89views more MOC 1998»

Approximation of continuous time stochastic processes by a local linearization method

13 years 5 months ago

Download www.ams.org

This paper investigates the rate of convergence of an alternative approximation method for stochastic diﬀerential equations. The rates of convergence of the one-step and multi-st...

Isao Shoji

claim paper

Read More »

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

13 years 12 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

click to vote

CDC
2009
IEEE

138views Control Systems» more CDC 2009»

Semidefinite programming methods for system realization and identification

13 years 9 months ago

Download www.ee.ucla.edu

We describe semidefinite programming methods for system realization and identification. For each of these two applications, a variant of a simple subspace algorithm is presented, i...

Zhang Liu, Lieven Vandenberghe

claim paper

Read More »

« Prev « First page 2 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers