Sciweavers

135 search results - page 20 / 27
» Using Reinforcement Learning to Coordinate Better
Sort
View
ACL
2010
14 years 7 months ago
Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems
We present a data-driven approach to learn user-adaptive referring expression generation (REG) policies for spoken dialogue systems. Referring expressions can be difficult to unde...
Srinivasan Janarthanam, Oliver Lemon
GECCO
2009
Springer
200views Optimization» more  GECCO 2009»
15 years 4 months ago
Apply ant colony optimization to Tetris
Tetris is a falling block game where the player’s objective is to arrange a sequence of different shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...
Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...
82
Voted
ACL
2010
14 years 7 months ago
Importance-Driven Turn-Bidding for Spoken Dialogue Systems
Current turn-taking approaches for spoken dialogue systems rely on the speaker releasing the turn before the other can take it. This reliance results in restricted interactions th...
Ethan Selfridge, Peter A. Heeman
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
15 years 1 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
ICML
2009
IEEE
15 years 10 months ago
Monte-Carlo simulation balancing
In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...
David Silver, Gerald Tesauro