Search Sciweavers | Sciweavers

200 search results - page 38 / 40

» Point-Based Policy Iteration

187

click to vote

MOBIHOC
2007
ACM

150views Computer Networks» more MOBIHOC 2007»

Distributed opportunistic scheduling for ad-hoc communications: an optimal stopping approach

16 years 6 months ago

Download www.public.asu.edu

We consider distributed opportunistic scheduling (DOS) in wireless ad-hoc networks, where many links contend for the same channel using random access. In such networks, distribute...

Dong Zheng, Weiyan Ge, Junshan Zhang

claim paper

Read More »

182

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

15 years 8 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

194

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

216

Voted

WWW
2010
ACM

243views Internet Technology» more WWW 2010»

Privacy wizards for social networking sites

16 years 1 months ago

Download www.eecs.umich.edu

Privacy is an enormous problem in online social networking sites. While sites such as Facebook allow users ﬁne-grained control over who can see their proﬁles, it is diﬃcult ...

Lujun Fang, Kristen LeFevre

claim paper

Read More »

149

click to vote

ICRA
2008
IEEE

167views Robotics» more ICRA 2008»

An approximate algorithm for solving oracular POMDPs

16 years 1 months ago

Download www.cs.cmu.edu

Abstract— We propose a new approximate algorithm, LAJIV (Lookahead J-MDP Information Value), to solve Oracular Partially Observable Markov Decision Problems (OPOMDPs), a special ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 38 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers