Search Sciweavers | Sciweavers

74 search results - page 1 / 15

» Regret Bounds for Gaussian Process Bandit Problems

click to vote

JMLR
2010

125views more JMLR 2010»

Regret Bounds for Gaussian Process Bandit Problems

12 years 11 months ago

Download jmlr.csail.mit.edu

Bandit algorithms are concerned with trading exploration with exploitation where a number of options are available but we can only learn their quality by experimenting with them. ...

Steffen Grünewälder, Jean-Yves Audibert,...

claim paper

Read More »

click to vote

ICML
2010
IEEE

204views Machine Learning» more ICML 2010»

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

13 years 5 months ago

Download www.its.caltech.edu

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is ...

Niranjan Srinivas, Andreas Krause, Sham Kakade, Ma...

claim paper

Read More »

click to vote

CORR
2010
Springer

174views Education» more CORR 2010»

Gaussian Process Bandits for Tree Search

13 years 4 months ago

Download www.mendeley.com

We motivate and analyse a new Tree Search algorithm, based on recent advances in the use of Gaussian Processes for bandit problems. We assume that the function to maximise on the ...

Louis Dorard, John Shawe-Taylor

claim paper

Read More »

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

12 years 11 months ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

click to vote

ALT
2009
Springer

128views Machine Learning» more ALT 2009»

Pure Exploration in Multi-armed Bandits Problems

14 years 1 months ago

Download sequel.futurs.inria.fr

Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...

Sébastien Bubeck, Rémi Munos, Gilles...

claim paper

Read More »

« Prev « First page 1 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers