Sciweavers

CDC
2009
IEEE
169views Control Systems» more  CDC 2009»
13 years 9 months ago
Parametric regret in uncertain Markov decision processes
— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...
Huan Xu, Shie Mannor