Sciweavers

2 search results - page 1 / 1
» Provably Near-Optimal Sampling-Based Policies for Stochastic...
Sort
View
67
Voted
MOR
2007
125views more  MOR 2007»
14 years 10 months ago
Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models
Retsef Levi, Robin Roundy, David B. Shmoys
IJCAI
2001
14 years 12 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz