We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...
In this work, we introduce an information-theoreticbased correction term to the likelihood ratio classification method for multiple classes. Under certain conditions, the term is ...
Bayesian inference is an appealing approach for leveraging prior knowledge in reinforcement learning (RL). In this paper we describe an algorithm for discovering different classes...
Nested Monte-Carlo search is a general algorithm that gives good results in single player games. Genetic Programming evaluates and combines trees to discover expressions that maxim...
In this paper, we present a combined study of price competition and traffic control in a congested network. We study a model in which service providers own the routes in a network...