Search Sciweavers | Sciweavers

97 search results - page 3 / 20

» Logarithmic Regret Algorithms for Online Convex Optimization

click to vote

ICML
2009
IEEE

159views Machine Learning» more ICML 2009»

Efficient learning algorithms for changing environments

14 years 6 months ago

Download www.cs.princeton.edu

We study online learning in an oblivious changing environment. The standard measure of regret bounds the difference between the cost of the online learner and the best decision in...

Elad Hazan, C. Seshadhri

claim paper

Read More »

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 5 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Algorithms for portfolio management based on the Newton method

14 years 6 months ago

Download www.cs.princeton.edu

We experimentally study on-line investment algorithms first proposed by Agarwal and Hazan and extended by Hazan et al. which achieve almost the same wealth as the best constant-re...

Amit Agarwal, Elad Hazan, Satyen Kale, Robert E. S...

claim paper

Read More »

click to vote

COLT
2010
Springer

205views Machine Learning» more COLT 2010»

Convex Games in Banach Spaces

13 years 3 months ago

Download www.cs.utexas.edu

We study the regret of an online learner playing a multi-round game in a Banach space B against an adversary that plays a convex function at each round. We characterize the minima...

Karthik Sridharan, Ambuj Tewari

claim paper

Read More »

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

13 years 6 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 3 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers