Sciweavers

121 search results - page 14 / 25
» Learning Decision Theoretic Utilities through Reinforcement ...
Sort
View
ICML
2001
IEEE
15 years 10 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
121
Voted
SIGDIAL
2010
14 years 7 months ago
Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy
This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...
Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...
GAMEON
2007
14 years 11 months ago
Agent Based Virtual Tutorship and E-Learning Techniques Applied to a Business Game Built on System Dynamics
An advanced Business Game is presented in the paper, built on the methodology of System Dynamics. It can be used for cognitive learning and knowledge transmission in schools and U...
Marco Remondino
ISPE
2003
14 years 11 months ago
Coordination in utility managed multi-agent groups
A two stage approach to co-ordination in a multi-agent society is presented. The first stage involves agents learning to co-ordinate their activities based on local and global uti...
Fernanda Barbosa, José C. Cunha, Omer F. Ra...
LION
2007
Springer
192views Optimization» more  LION 2007»
15 years 3 months ago
Learning While Optimizing an Unknown Fitness Surface
This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...
Roberto Battiti, Mauro Brunato, Paolo Campigotto