Search Sciweavers | Sciweavers

590 search results - page 68 / 118

» Can We Learn to Beat the Best Stock

120

click to vote

JMLR
2010

125views more JMLR 2010»

Regret Bounds for Gaussian Process Bandit Problems

14 years 8 months ago

Download jmlr.csail.mit.edu

Bandit algorithms are concerned with trading exploration with exploitation where a number of options are available but we can only learn their quality by experimenting with them. ...

Steffen Grünewälder, Jean-Yves Audibert,...

claim paper

Read More »

121

click to vote

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

Distributed Value Functions

16 years 2 months ago

Download www.ri.cmu.edu

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

Voted

PAKDD
2010
ACM

117views Data Mining» more PAKDD 2010»

BASSET: Scalable Gateway Finder in Large Graphs

15 years 6 months ago

Download eliassi.org

Given a social network, who is the best person to introduce you to, say, Chris Ferguson, the poker champion? Or, given a network of people and skills, who is the best person to he...

Hanghang Tong, Spiros Papadimitriou, Christos Falo...

claim paper

Read More »

114

click to vote

MLMI
2007
Springer

99views Machine Learning» more MLMI 2007»

Meeting State Recognition from Visual and Aural Labels

15 years 7 months ago

Download groups.inf.ed.ac.uk

In this paper we present a meeting state recognizer based on a combination of multi-modal sensor data in a smart room. Our approach is based on the training of a statistical model ...

Jan Curín, Pascal Fleury, Jan Kleindienst, ...

claim paper

Read More »

136

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 2 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

« Prev « First page 68 / 118 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers