Defensive Universal Learning with Experts

16 years 1 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably inﬁnite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. From this, we obtain a master algorithm for “reactive” experts problems, which means that the master’s actions may inﬂuence the behavior of the adversary. Our algorithm can signiﬁcantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs – in a certain sense – almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow. Keywords. Prediction with expert advice, responsive envir...

Jan Poland, Marcus Hutter

Real-time Traffic