Sciweavers

106 search results - page 14 / 22
» Performance Bounded Reinforcement Learning in Strategic Inte...
Sort
View
ACL
2010
15 years 3 months ago
Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems
We present a data-driven approach to learn user-adaptive referring expression generation (REG) policies for spoken dialogue systems. Referring expressions can be difficult to unde...
Srinivasan Janarthanam, Oliver Lemon
ATAL
2008
Springer
15 years 7 months ago
Simultaneously modeling humans' preferences and their beliefs about others' preferences
In strategic multiagent decision making, it is often the case that a strategic reasoner must hold beliefs about other agents and use these beliefs to inform its decision making. T...
Sevan G. Ficici, Avi Pfeffer
ATAL
2004
Springer
15 years 10 months ago
Learning User Preferences for Wireless Services Provisioning
The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...
George Lee, Steven Bauer, Peyman Faratin, John Wro...
NIPS
1993
15 years 6 months ago
Temporal Difference Learning of Position Evaluation in the Game of Go
The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...
Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...
140
Voted
ICML
2009
IEEE
16 years 6 months ago
Near-Bayesian exploration in polynomial time
We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...
J. Zico Kolter, Andrew Y. Ng