Search Sciweavers | Sciweavers

271 search results - page 1 / 55

» Bayesian Reward Filtering

189

click to vote

EWRL
2008

191views Machine Learning» more EWRL 2008»

Bayesian Reward Filtering

15 years 6 months ago

Download www.metz.supelec.fr

A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

152

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 5 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

146

click to vote

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

16 years 5 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

126

click to vote

CORR
2012
Springer

192views Education» more CORR 2012»

The best of both worlds: stochastic and adversarial bandits

14 years 2 days ago

Download www.princeton.edu

We present a bandit algorithm, SAO (Stochastic and Adversarial Optimal), whose regret is, essentially, optimal both for adversarial rewards and for stochastic rewards. Speciﬁcal...

Sébastien Bubeck, Aleksandrs Slivkins

claim paper

Read More »

133

click to vote

WEBI
2009
Springer

152views Internet Technology» more WEBI 2009»

Zero-Sum Reward and Punishment Collaborative Filtering Recommendation Algorithm

15 years 11 months ago

Download dm.thss.tsinghua.edu.cn

In this paper, we propose a novel memory-based collaborative ﬁltering recommendation algorithm. Our algorithm use a new metric named inﬂuence weight, which is adjusted with ze...

Nan Li, Chunping Li

claim paper

Read More »

« Prev « First page 1 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers