Sciweavers

2 search results - page 1 / 1

» Provably Near-Optimal Sampling-Based Policies for Stochastic...

144

MOR
2007

125views more MOR 2007»

Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models

15 years 6 months ago

Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models

Download legacy.orie.cornell.edu

Retsef Levi, Robin Roundy, David B. Shmoys

claim paper

Read More »

217

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 8 months ago

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »