— Today, the most important aspect related with the Internet architecture is its ossification representing the difficulties to introduce evolutions in the architecture as a way...
: We have modi ed the popular Mbone tool Vic (VIdeo Conferencing) to use Arequipa (Application REQuested IPoverATM). The latter enables applications and in particular Vic, to reque...
Werner Almesberger, Leena Chandran-Wadia, Silvia G...
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...
— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...