Search Sciweavers | Sciweavers

263 search results - page 4 / 53

» Regret Bounds for Prediction Problems

click to vote

ALT
2010
Springer

147views Machine Learning» more ALT 2010»

Optimal Online Prediction in Adversarial Environments

13 years 7 months ago

Download www.math.pku.edu.cn

: In many prediction problems, including those that arise in computer security and computational finance, the process generating the data is best modeled as an adversary with whom ...

Peter L. Bartlett

claim paper

Read More »

click to vote

DISOPT
2010

132views more DISOPT 2010»

General approximation schemes for min-max (regret) versions of some (pseudo-)polynomial problems

13 years 5 months ago

Download www.lamsade.dauphine.fr

While the complexity of min-max and min-max regret versions of most classical combinatorial optimization problems has been thoroughly investigated, there are very few studies abou...

Hassene Aissi, Cristina Bazgan, Daniel Vanderpoote...

claim paper

Read More »

click to vote

ORL
2008

99views more ORL 2008»

Some tractable instances of interval data minmax regret problems

13 years 5 months ago

Download www.lamsade.dauphine.fr

This paper focuses on tractable instances of interval data minmax regret graph problems. More precisely, we provide polynomial and pseudopolynomial algorithms for sets of particul...

Bruno Escoffier, Jérôme Monnot, Olivi...

claim paper

Read More »

click to vote

JMLR
2010

103views more JMLR 2010»

Regret Bounds and Minimax Policies under Partial Monitoring

13 years 16 days ago

Download jmlr.csail.mit.edu

This work deals with four classical prediction settings, namely full information, bandit, label efficient and bandit label efficient as well as four different notions of regret: p...

Jean-Yves Audibert, Sébastien Bubeck

claim paper

Read More »

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

13 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

« Prev « First page 4 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers