Search Sciweavers | Sciweavers

135 search results - page 20 / 27

» Using Reinforcement Learning to Coordinate Better

103

click to vote

ACL
2010

176views Computational Linguistics» more ACL 2010»

Learning to Adapt to Unknown Users: Referring Expression Generation in Spoken Dialogue Systems

14 years 9 months ago

Download aclweb.org

We present a data-driven approach to learn user-adaptive referring expression generation (REG) policies for spoken dialogue systems. Referring expressions can be difficult to unde...

Srinivasan Janarthanam, Oliver Lemon

claim paper

Read More »

click to vote

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

15 years 6 months ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

click to vote

ACL
2010

161views Computational Linguistics» more ACL 2010»

Importance-Driven Turn-Bidding for Spoken Dialogue Systems

14 years 9 months ago

Download www.cse.ogi.edu

Current turn-taking approaches for spoken dialogue systems rely on the speaker releasing the turn before the other can take it. This reliance results in restricted interactions th...

Ethan Selfridge, Peter A. Heeman

claim paper

Read More »

click to vote

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

15 years 3 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 12 days ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

« Prev « First page 20 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers