Search Sciweavers | Sciweavers

2 search results - page 1 / 1

» Piecewise-stationary bandit problems with side observations

click to vote

ICML
2009
IEEE

109views Machine Learning» more ICML 2009»

Piecewise-stationary bandit problems with side observations

14 years 10 months ago

Download www.cim.mcgill.ca

We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may c...

Jia Yuan Yu, Shie Mannor

claim paper

Read More »

click to vote

ICML
2008
IEEE

120views Machine Learning» more ICML 2008»

Exploration scavenging

14 years 10 months ago

Download hunch.net

We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...

John Langford, Alexander L. Strehl, Jennifer Wortm...

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers