Abstract. Most of multi-agent reinforcement learning algorithms aim to converge to a Nash equilibrium, but a Nash equilibrium does not necessarily mean a desirable result. On the o...
Although many powerful AI and machine learning techniques exist, it remains difficult to quickly create AI for embodied virtual agents that produces visually lifelike behavior. T...
Jonathan Dinerstein, Parris K. Egbert, Dan Ventura
Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...
While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...
Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...
Abstract. Knowledge work is performed in all occupations and across all industries. The level of similarity of knowledge work allows for designing supporting tools that can be wide...
Andreas Kaschig, Ronald Maier, Alexander Sandow, M...