Sciweavers

1233 search results - page 195 / 247
» Reinforcement learning
Sort
View
JCP
2007
143views more  JCP 2007»
15 years 3 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
NN
2002
Springer
15 years 3 months ago
Opponent interactions between serotonin and dopamine
Anatomical and pharmacological evidence suggests that the dorsal raphe serotonin system and the ventral tegmental and substantia nigra dopamine system may act as mutual opponents....
Nathaniel D. Daw, Sham Kakade, Peter Dayan
133
Voted
ACL
2010
15 years 2 months ago
Importance-Driven Turn-Bidding for Spoken Dialogue Systems
Current turn-taking approaches for spoken dialogue systems rely on the speaker releasing the turn before the other can take it. This reliance results in restricted interactions th...
Ethan Selfridge, Peter A. Heeman
INLG
2010
Springer
15 years 1 months ago
Feature Selection for Fluency Ranking
16:30 Generating and Validating Abstracts of Meeting Conversations: a User Study. Gabriel Murray, Giuseppe Carenini and Raymond Ng 16:30 - 16:45 Break Session 3: Sentence Level Gen...
Daniël de Kok
SIGDIAL
2010
15 years 1 months ago
Sparse Approximate Dynamic Programming for Dialog Management
Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...
Senthilkumar Chandramohan, Matthieu Geist, Olivier...