Search Sciweavers | Sciweavers

651 search results - page 92 / 131

» Algorithms for Inverse Reinforcement Learning

112

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 3 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

112

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 1 months ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

138

click to vote

ACL
1998

129views Computational Linguistics» more ACL 1998»

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email

15 years 4 months ago

Download acl.eldoc.ub.rug.nl

This paper describes a novel method by which a dialogue agent can learn to choose an optimal dialogue strategy. While it is widely agreed that dialogue strategies should be formul...

Marilyn A. Walker, Jeanne Frommer, Shrikanth Naray...

claim paper

Read More »

112

click to vote

IIE
2007

63views more IIE 2007»

Investigation of Q-Learning in the Context of a Virtual Learning Environment

15 years 3 months ago

Download www.mii.lt

We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...

Dalia Baziukaite

claim paper

Read More »

129

click to vote

ALIFE
2002

176views Modeling And Simulation» more ALIFE 2002»

Ant Colony Optimization and Stochastic Gradient Descent

15 years 2 months ago

Download ti.arc.nasa.gov

In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...

Nicolas Meuleau, Marco Dorigo

claim paper

Read More »

« Prev « First page 92 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers