While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...
Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...
We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...
Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...
In this paper we propose a general framework for local pathplanning and steering that can be easily extended to perform highlevel behaviors. Our framework is based on the concept ...
Mubbasir Kapadia, Shawn Singh, William Hewlett, Pe...
— When navigating in an unknown environment for the first time, a natural behavior consists in memorizing some key views along the performed path, in order to use these referenc...
Guillaume Le Blanc, Youcef Mezouar, Philippe Marti...
We present the XDI Model for specifying delay-insensitive circuits, that is, reactive systems that correctly exchange signals with their environment in spite of unknown delays inc...