Sciweavers

COLT
2005
Springer

From External to Internal Regret

13 years 6 months ago
From External to Internal Regret
External regret compares the performance of an online algorithm, selecting among N actions, to the performance of the best of those actions in hindsight. Internal regret compares the loss of an online algorithm to the loss of a modified online algorithm, which consistently replaces one action by another. In this paper we give a simple generic reduction that, given an algorithm for the external regret problem, converts it to an efficient online algorithm for the internal regret problem. We provide methods that work both in the full information model, in which the loss of every action is observed at each time step, and the partial information (bandit) model, where at each time step only the loss of the selected action is observed. The importance of internal regret in game theory is due to the fact that in a general game, if each player has sublinear internal regret, then the empirical frequencies converge to a correlated equilibrium. For external regret we also derive a quantitative re...
Avrim Blum, Yishay Mansour
Added 13 Oct 2010
Updated 13 Oct 2010
Type Conference
Year 2005
Where COLT
Authors Avrim Blum, Yishay Mansour
Comments (0)