We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
In a component oriented approach, components are designed, developed and validated in order to be widely used. However one cannot always foresee which specific uses will be made o...