Search Sciweavers | Sciweavers

23

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

11 years 7 months ago

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

10

click to vote

COCOON
2006
Springer

121views Combinatorics» more COCOON 2006»

Approximating Min-Max (Regret) Versions of Some Polynomial Problems

13 years 8 months ago

Download www.lamsade.dauphine.fr

Abstract. While the complexity of min-max and min-max regret versions of most classical combinatorial optimization problems has been thoroughly investigated, there are very few stu...

Hassene Aissi, Cristina Bazgan, Daniel Vanderpoote...

claim paper

Read More »

11

click to vote

ALT
2007
Springer

134views Machine Learning» more ALT 2007»

Tuning Bandit Algorithms in Stochastic Environments

14 years 1 months ago

Download www.sztaki.hu

Algorithms based on upper-conﬁdence bounds for balancing exploration and exploitation are gaining popularity since they are easy to implement, eﬃcient and eﬀective. In this p...

Jean-Yves Audibert, Rémi Munos, Csaba Szepe...

claim paper

Read More »

14

click to vote

JMLR
2010

125views more JMLR 2010»

Regret Bounds for Gaussian Process Bandit Problems

12 years 11 months ago

Download jmlr.csail.mit.edu

Bandit algorithms are concerned with trading exploration with exploitation where a number of options are available but we can only learn their quality by experimenting with them. ...

Steffen Grünewälder, Jean-Yves Audibert,...

claim paper

Read More »

13

click to vote

ECCC
2010

80views more ECCC 2010»

Regret Minimization for Online Buffering Problems Using the Weighted Majority Algorithm

13 years 4 months ago

Download www.colt2010.org

Suppose a decision maker has to purchase a commodity over time with varying prices and demands. In particular, the price per unit might depend on the amount purchased and this pri...

Melanie Winkler, Berthold Vöcking, Sascha Geu...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers