Sciweavers

9841 search results - page 30 / 1969
» Distributed Value Functions
Sort
View
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
15 years 4 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
ISAAC
2009
Springer
83views Algorithms» more  ISAAC 2009»
15 years 4 months ago
Reconstructing Numbers from Pairwise Function Values
Shiteng Chen, Zhiyi Huang, Sampath Kannan
ASCM
2007
Springer
283views Mathematics» more  ASCM 2007»
15 years 3 months ago
Computing the Minkowski Value of the Exponential Function over a Complex Disk
Hyeong In Choi, Rida T. Farouki, Chang Yong Han, H...