Sciweavers

CDC
2008
IEEE

104views Control Systems» more CDC 2008»

A structured multiarmed bandit problem and the greedy policy

13 years 11 months ago

—We consider a multiarmed bandit problem where the expected reward of each arm is a linear function of an unknown scalar with a prior distribution. The objective is to choose a s...

Adam J. Mersereau, Paat Rusmevichientong, John N. ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers