Search Sciweavers | Sciweavers

59

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Deterministic Calibration and Nash Equilibrium

15 years 3 months ago

Abstract. We provide a natural learning process in which the joint frequency of empirical play converges into the set of convex combinations of Nash equilibria. In this process, al...

Sham Kakade, Dean P. Foster

claim paper

Read More »

73

click to vote

ISAMI
2010

124views Emerging Technology» more ISAMI 2010»

Accurate Temporal Relationships in Sequences of User Behaviours in Intelligent Environments

14 years 8 months ago

Download www.infj.ulst.ac.uk

Intelligent Environments are supposed to act proactively anticipating user's needs and preferences in order to provide effective support. Therefore, learning user's frequ...

Asier Aztiria, Juan Carlos Augusto, Rosa Basagoiti...

claim paper

Read More »

74

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 3 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

96

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 2 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

67

click to vote

ACL
2008

96views Computational Linguistics» more ACL 2008»

Using Automatically Transcribed Dialogs to Learn User Models in a Spoken Dialog System

14 years 11 months ago

Download www.aclweb.org

We use an EM algorithm to learn user models in a spoken dialog system. Our method requires automatically transcribed (with ASR) dialog corpora, plus a model of transcription error...

Umar Syed, Jason Williams

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers