Search Sciweavers | Sciweavers

21

CORR
2010
Springer

187views Education» more CORR 2010»

Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit

13 years 5 months ago

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

13

click to vote

ICASSP
2011
IEEE

177views Signal Processing» more ICASSP 2011»

Logarithmic weak regret of non-Bayesian restless multi-armed bandit

12 years 8 months ago

Download www.ece.ucdavis.edu

Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

18

click to vote

ICRA
2009
IEEE

132views Robotics» more ICRA 2009»

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

13 years 11 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actio...

Deepak Ramachandran, Rakesh Gupta

claim paper

Read More »

15

click to vote

CORR
2000
Springer

126views Education» more CORR 2000»

Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach

13 years 4 months ago

Download eric.univ-lyon2.fr

We investigate the performance of two machine learning algorithms in the context of antispam filtering. The increasing volume of unsolicited bulk e-mail (spam) has generated a nee...

Ion Androutsopoulos, Georgios Paliouras, Vangelis ...

claim paper

Read More »

7

click to vote

NIPS
2004

134views Information Technology» more NIPS 2004»

Bayesian Regularization and Nonnegative Deconvolution for Time Delay Estimation

13 years 6 months ago

Download books.nips.cc

Bayesian Regularization and Nonnegative Deconvolution (BRAND) is proposed for estimating time delays of acoustic signals in reverberant environments. Sparsity of the nonnegative f...

Yuanqing Lin, Daniel D. Lee

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers