Search Sciweavers | Sciweavers

64 search results - page 8 / 13

» *-Minimax Performance in Backgammon

123

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 1 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

113

click to vote

COLT
2010
Springer

129views Machine Learning» more COLT 2010»

Nonparametric Bandits with Covariates

14 years 9 months ago

Download www.princeton.edu

We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...

Philippe Rigollet, Assaf Zeevi

claim paper

Read More »

107

click to vote

JSAC
2011

159views more JSAC 2011»

An Anti-Jamming Stochastic Game for Cognitive Radio Networks

14 years 6 months ago

Download sig.umd.edu

—Various spectrum management schemes have been proposed in recent years to improve the spectrum utilization in cognitive radio networks. However, few of them have considered the ...

Beibei Wang, Yongle Wu, K. J. Ray Liu, T. Charles ...

claim paper

Read More »

104

click to vote

COLT
2006
Springer

132views Machine Learning» more COLT 2006»

Online Learning with Variable Stage Duration

15 years 3 months ago

Download www.ece.mcgill.ca

We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results esta...

Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

AUSAI
2009
Springer

99views Artificial Intelligence» more AUSAI 2009»

MML Invariant Linear Regression

15 years 6 months ago

Download www.csse.monash.edu.au

Abstract. This paper derives two new information theoretic linear regression criteria based on the minimum message length principle. Both criteria are invariant to full rank aﬃne...

Daniel F. Schmidt, Enes Makalic

claim paper

Read More »

« Prev « First page 8 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers