Sciweavers

3084 search results - page 136 / 617
» Learning to Take Actions
Sort
View
COLT
2004
Springer
15 years 3 months ago
Deterministic Calibration and Nash Equilibrium
Abstract. We provide a natural learning process in which the joint frequency of empirical play converges into the set of convex combinations of Nash equilibria. In this process, al...
Sham Kakade, Dean P. Foster
ISAMI
2010
14 years 8 months ago
Accurate Temporal Relationships in Sequences of User Behaviours in Intelligent Environments
Intelligent Environments are supposed to act proactively anticipating user's needs and preferences in order to provide effective support. Therefore, learning user's frequ...
Asier Aztiria, Juan Carlos Augusto, Rosa Basagoiti...
COLT
2004
Springer
15 years 3 months ago
Reinforcement Learning for Average Reward Zero-Sum Games
Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The first is based on relative Q-learning and the ...
Shie Mannor
ICANN
2001
Springer
15 years 2 months ago
Market-Based Reinforcement Learning in Partially Observable Worlds
Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...
Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber
ACL
2008
14 years 11 months ago
Using Automatically Transcribed Dialogs to Learn User Models in a Spoken Dialog System
We use an EM algorithm to learn user models in a spoken dialog system. Our method requires automatically transcribed (with ASR) dialog corpora, plus a model of transcription error...
Umar Syed, Jason Williams