Sciweavers

6 search results - page 2 / 2
» Online Regret Bounds for Markov Decision Processes with Dete...
Sort
View
UAI
2000
13 years 5 months ago
PEGASUS: A policy search method for large MDPs and POMDPs
We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...
Andrew Y. Ng, Michael I. Jordan