Sciweavers

127 search results - page 2 / 26
» A linear approximation method for the Shapley value
Sort
View
ICML
2007
IEEE
14 years 6 months ago
Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation
Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...
Chee Wee Phua, Robert Fitch
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
13 years 11 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
MOC
1998
89views more  MOC 1998»
13 years 5 months ago
Approximation of continuous time stochastic processes by a local linearization method
This paper investigates the rate of convergence of an alternative approximation method for stochastic differential equations. The rates of convergence of the one-step and multi-st...
Isao Shoji
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
13 years 12 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
CDC
2009
IEEE
138views Control Systems» more  CDC 2009»
13 years 9 months ago
Semidefinite programming methods for system realization and identification
We describe semidefinite programming methods for system realization and identification. For each of these two applications, a variant of a simple subspace algorithm is presented, i...
Zhang Liu, Lieven Vandenberghe