Search Sciweavers | Sciweavers

27 search results - page 1 / 6

» Improved Rates for the Stochastic Continuum-Armed Bandit Pro...

click to vote

COLT
2007
Springer

144views Machine Learning» more COLT 2007»

Improved Rates for the Stochastic Continuum-Armed Bandit Problem

14 years 4 days ago

Download www.sztaki.hu

Abstract. Considering one-dimensional continuum-armed bandit problems, we propose an improvement of an algorithm of Kleinberg and a new set of conditions which give rise to improve...

Peter Auer, Ronald Ortner, Csaba Szepesvári

claim paper

Read More »

click to vote

NIPS
2004

136views Information Technology» more NIPS 2004»

Nearly Tight Bounds for the Continuum-Armed Bandit Problem

13 years 7 months ago

Download books.nips.cc

In the multi-armed bandit problem, an online algorithm must choose from a set of strategies in a sequence of n trials so as to minimize the total cost of the chosen strategies. Wh...

Robert D. Kleinberg

claim paper

Read More »

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

13 years 29 days ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

click to vote

JMLR
2012

165views Programming Languages» more JMLR 2012»

PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits

11 years 8 months ago

Download homes.di.unimi.it

We develop a new tool for data-dependent analysis of the exploration-exploitation trade-oﬀ in learning under limited feedback. Our tool is based on two main ingredients. The ﬁ...

Yevgeny Seldin, Nicolò Cesa-Bianchi, Peter ...

claim paper

Read More »

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

13 years 5 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

« Prev « First page 1 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers