Sciweavers

7 search results - page 2 / 2
» TiMDPpoly: An Improved Method for Solving Time-Dependent MDP...
Sort
View
CORR
2010
Springer
105views Education» more  CORR 2010»
13 years 3 months ago
Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence
We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...
Sarah Filippi, Olivier Cappé, Aurelien Gari...
SIAMNUM
2010
115views more  SIAMNUM 2010»
12 years 11 months ago
Superconvergence of Discontinuous Galerkin and Local Discontinuous Galerkin Schemes for Linear Hyperbolic and Convection-Diffusi
In this paper, we study the superconvergence property for the discontinuous Galerkin (DG) and the local discontinuous Galerkin (LDG) methods, for solving one-dimensional time depe...
Yingda Cheng, Chi-Wang Shu