Sciweavers

138 search results - page 1 / 28
» An Inverse Method for Policy-Iteration Based Algorithms
Sort
View
CORR
2009
Springer
88views Education» more  CORR 2009»
13 years 3 months ago
An Inverse Method for Policy-Iteration Based Algorithms
Laurent Fribourg, Étienne André
AAAI
2006
13 years 7 months ago
Incremental Least Squares Policy Iteration for POMDPs
We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...
Hui Li, Xuejun Liao, Lawrence Carin
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
13 years 18 days ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas
SIAMSC
2011
140views more  SIAMSC 2011»
12 years 8 months ago
A Fast Parallel Algorithm for Selected Inversion of Structured Sparse Matrices with Application to 2D Electronic Structure Calcu
Abstract. An efficient parallel algorithm is presented and tested for computing selected components of H−1 where H has the structure of a Hamiltonian matrix of two-dimensional la...
Lin Lin, Chao Yang, Jianfeng Lu, Lexing Ying, Wein...
AMDO
2008
Springer
13 years 7 months ago
Inverse Kinematics Using Sequential Monte Carlo Methods
Abstract. In this paper we propose an original approach to solve the Inverse Kinematics problem. Our framework is based on Sequential Monte Carlo Methods and has the advantage to a...
Nicolas Courty, Elise Arnaud