Search Sciweavers | Sciweavers

18 search results - page 2 / 4

» Incremental Least Squares Policy Iteration for POMDPs

click to vote

ICML
2010
IEEE

202views Machine Learning» more ICML 2010»

Least-Squares Policy Iteration: Bias-Variance Trade-off in Control Problems

13 years 6 months ago

Download www.icml2010.org

Christophe Thiery, Bruno Scherrer

claim paper

Read More »

click to vote

GRC
2008
IEEE

190views Applied Computing» more GRC 2008»

Adaptive and Iterative Least Squares Support Vector Regression based on Quadratic Renyi Entropy

13 years 6 months ago

Download www.nlpr.ia.ac.cn

An adaptive and iterative LSSVR algorithm based on quadratic Renyi entropy is presented in this paper. LS-SVM loses the sparseness of support vector which is one of the important ...

Jingqing Jiang, Chuyi Song, Haiyan Zhao, Chunguo W...

claim paper

Read More »

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

13 years 4 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

click to vote

AAAI
2007

126views Intelligent Agents» more AAAI 2007»

Point-Based Policy Iteration

13 years 7 months ago

Download www.cs.duke.edu

We describe a point-based policy iteration (PBPI) algorithm for inﬁnite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...

Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...

claim paper

Read More »

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

13 years 6 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

« Prev « First page 2 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers