Search Sciweavers | Sciweavers

88

CDC
2009
IEEE

118views Control Systems» more CDC 2009»

Opportunistic scheduling in cellular systems in the presence of non-cooperative mobiles

15 years 4 months ago

Abstract— A central scheduling problem in wireless communications is that of allocating resources to one of many mobile stations that have a common radio channel. Much attention ...

Kavitha Veeraruna, Eitan Altman, Rachid El Azouzi,...

claim paper

Read More »

81

click to vote

IJCAI
2007

140views Artificial Intelligence» more IJCAI 2007»

Utile Distinctions for Relational Reinforcement Learning

15 years 1 months ago

Download www.ijcai.org

We introduce an approach to autonomously creating state space abstractions for an online reinforcement learning agent using a relational representation. Our approach uses a tree-b...

William Dabney, Amy McGovern

claim paper

Read More »

92

click to vote

AAAI
2000

104views Intelligent Agents» more AAAI 2000»

Deliberation in Equilibrium: Bargaining in Computationally Complex Problems

15 years 1 months ago

Download www.aaai.org

We develop a normative theory of interaction-negotiation in particular--among self-interested computationally limited agents where computational actions are game-theoretically tre...

Kate Larson, Tuomas Sandholm

claim paper

Read More »

111

Voted

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

14 years 9 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

96

click to vote

AIPS
2010

174views Artificial Intelligence» more AIPS 2010»

When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters

15 years 2 months ago

Download www.cs.berkeley.edu

Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...

Emma Brunskill

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers