Search Sciweavers | Sciweavers

121 search results - page 14 / 25

» Learning Decision Theoretic Utilities through Reinforcement ...

217

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

263

click to vote

SIGDIAL
2010

137views Natural Language Processing» more SIGDIAL 2010»

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

15 years 5 months ago

Download mastarpj.nict.go.jp

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...

Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...

claim paper

Read More »

188

click to vote

GAMEON
2007

139views Modeling And Simulation» more GAMEON 2007»

Agent Based Virtual Tutorship and E-Learning Techniques Applied to a Business Game Built on System Dynamics

15 years 9 months ago

Download www.di.unito.it

An advanced Business Game is presented in the paper, built on the methodology of System Dynamics. It can be used for cognitive learning and knowledge transmission in schools and U...

Marco Remondino

claim paper

Read More »

198

Voted

ISPE
2003

134views Distributed And Parallel Com...» more ISPE 2003»

Coordination in utility managed multi-agent groups

15 years 9 months ago

Download asc.di.fct.unl.pt

A two stage approach to co-ordination in a multi-agent society is presented. The ﬁrst stage involves agents learning to co-ordinate their activities based on local and global uti...

Fernanda Barbosa, José C. Cunha, Omer F. Ra...

claim paper

Read More »

241

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

16 years 1 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

« Prev « First page 14 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers