Search Sciweavers | Sciweavers

44 search results - page 9 / 9

» Simple Stochastic Parity Games

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

13 years 4 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

click to vote

AIPS
2010

174views Artificial Intelligence» more AIPS 2010»

When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters

13 years 7 months ago

Download www.cs.berkeley.edu

Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...

Emma Brunskill

claim paper

Read More »

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

13 years 6 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

click to vote

SIGMETRICS
2008
ACM

161views Hardware» more SIGMETRICS 2008»

Noncooperative power control and transmission scheduling in wireless collision channels

13 years 4 months ago

Download www.mit.edu

We consider a wireless collision channel, shared by a finite number of mobile users who transmit to a common base station using a random access protocol. Mobiles are selfoptimizin...

Ishai Menache, Nahum Shimkin

claim paper

Read More »

« Prev « First page 9 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers