Sciweavers

22 search results - page 3 / 5
» The Usefulness of Hindsight
Sort
View
AIPS
2009
13 years 6 months ago
Lower Bounding Klondike Solitaire with Monte-Carlo Planning
Despite its ubiquitous presence, very little is known about the odds of winning the simple card game of Klondike Solitaire. The main goal of this paper is to investigate the use o...
Ronald Bjarnason, Alan Fern, Prasad Tadepalli
CIDU
2010
13 years 3 months ago
Tracking Climate Models
Abstract. Climate models are complex mathematical models designed by meteorologists, geophysicists, and climate scientists to simulate and predict climate. Given temperature predic...
Claire Monteleoni, Gavin Schmidt, Shailesh Saroha
ICML
2006
IEEE
14 years 6 months ago
Algorithms for portfolio management based on the Newton method
We experimentally study on-line investment algorithms first proposed by Agarwal and Hazan and extended by Hazan et al. which achieve almost the same wealth as the best constant-re...
Amit Agarwal, Elad Hazan, Satyen Kale, Robert E. S...
ICML
2009
IEEE
14 years 6 months ago
Efficient learning algorithms for changing environments
We study online learning in an oblivious changing environment. The standard measure of regret bounds the difference between the cost of the online learner and the best decision in...
Elad Hazan, C. Seshadhri
CVPR
2010
IEEE
14 years 1 months ago
Online Multiple Instance Learning with No Regret
Multiple instance (MI) learning is a recent learning paradigm that is more flexible than standard supervised learning algorithms in the handling of label ambiguity. It has been u...
Li Mu, James Kwok, Lu Bao-liang