Sciweavers

70
Voted
CORR
2008
Springer
64views Education» more  CORR 2008»
15 years 14 days ago
Linearly Parameterized Bandits
We consider bandit problems involving a large (possibly infinite) collection of arms, in which the expected reward of each arm is a linear function of an r-dimensional random vect...
Paat Rusmevichientong, John N. Tsitsiklis