Sciweavers

101 search results - page 19 / 21
» Control Strategies for a Stochastic Planner
Sort
View
72
Voted
ATAL
2010
Springer
14 years 10 months ago
Incremental plan aggregation for generating policies in MDPs
Despite the recent advances in planning with MDPs, the problem of generating good policies is still hard. This paper describes a way to generate policies in MDPs by (1) determiniz...
Florent Teichteil-Königsbuch, Ugur Kuter, Gui...
SIAMCOMP
2002
124views more  SIAMCOMP 2002»
14 years 9 months ago
The Nonstochastic Multiarmed Bandit Problem
Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...
SIAMCO
2010
116views more  SIAMCO 2010»
14 years 8 months ago
Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle
We consider linear-quadratic-Gaussian (LQG) games with a major player and a large number of minor players. The major player has a significant influence on others. The minor playe...
Minyi Huang
74
Voted
GECCO
2007
Springer
210views Optimization» more  GECCO 2007»
15 years 3 months ago
An application of EDA and GA to dynamic pricing
E-commerce has transformed the way firms develop their pricing strategies, producing shift away from fixed pricing to dynamic pricing. In this paper, we use two different Estim...
Siddhartha Shakya, Fernando Oliveira, Gilbert Owus...
AAAI
2000
14 years 11 months ago
Deliberation in Equilibrium: Bargaining in Computationally Complex Problems
We develop a normative theory of interaction-negotiation in particular--among self-interested computationally limited agents where computational actions are game-theoretically tre...
Kate Larson, Tuomas Sandholm