Sciweavers

1306 search results - page 227 / 262
» Strategy Acquisition for the Game
Sort
View
AAAI
2006
15 years 2 months ago
Point-based Dynamic Programming for DEC-POMDPs
We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...
Daniel Szer, François Charpillet
86
Voted
NIPS
2001
15 years 2 months ago
Intransitive Likelihood-Ratio Classifiers
In this work, we introduce an information-theoreticbased correction term to the likelihood ratio classification method for multiple classes. Under certain conditions, the term is ...
Jeff Bilmes, Gang Ji, Marina Meila
AAAI
2010
15 years 1 months ago
Bayesian Policy Search for Multi-Agent Role Discovery
Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning (RL). In this paper we describe an algorithm for discovering different classes...
Aaron Wilson, Alan Fern, Prasad Tadepalli
ECAI
2010
Springer
15 years 1 months ago
Nested Monte-Carlo Expression Discovery
Nested Monte-Carlo search is a general algorithm that gives good results in single player games. Genetic Programming evaluates and combines trees to discover expressions that maxim...
Tristan Cazenave
115
Voted
NETWORKS
2008
15 years 22 days ago
Price competition with elastic traffic
In this paper, we present a combined study of price competition and traffic control in a congested network. We study a model in which service providers own the routes in a network...
Asuman E. Ozdaglar