Search Sciweavers | Sciweavers

7 search results - page 2 / 2

» Online exploration in least-squares policy iteration

119

click to vote

MOBIHOC
2007
ACM

150views Computer Networks» more MOBIHOC 2007»

Distributed opportunistic scheduling for ad-hoc communications: an optimal stopping approach

16 years 1 months ago

Download www.public.asu.edu

We consider distributed opportunistic scheduling (DOS) in wireless ad-hoc networks, where many links contend for the same channel using random access. In such networks, distribute...

Dong Zheng, Weiyan Ge, Junshan Zhang

claim paper

Read More »

116

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 3 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers