Sciweavers

83 search results - page 3 / 17
» Optimization of a Billiard Player - Tactical Play
Sort
View
NIPS
2001
13 years 7 months ago
Playing is believing: The role of beliefs in multi-agent learning
We propose a new classification for multi-agent learning algorithms, with each league of players characterized by both their possible strategies and possible beliefs. Using this c...
Yu-Han Chang, Leslie Pack Kaelbling
FSTTCS
2010
Springer
13 years 4 months ago
Playing in stochastic environment: from multi-armed bandits to two-player games
Given a zero-sum infinite game we examine the question if players have optimal memoryless deterministic strategies. It turns out that under some general conditions the problem for...
Wieslaw Zielonka
FORMATS
2005
Springer
13 years 12 months ago
Average Reward Timed Games
We consider real-time games where the goal consists, for each player, in maximizing the average amount of reward he or she receives per time unit. We consider zero-sum rewards, so ...
B. Thomas Adler, Luca de Alfaro, Marco Faella
ICASSP
2011
IEEE
12 years 10 months ago
Logarithmic weak regret of non-Bayesian restless multi-armed bandit
Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...
Haoyang Liu, Keqin Liu, Qing Zhao
ML
1998
ACM
148views Machine Learning» more  ML 1998»
13 years 6 months ago
Colearning in Differential Games
Game playing has been a popular problem area for research in artificial intelligence and machine learning for many years. In almost every study of game playing and machine learnin...
John W. Sheppard