Search Sciweavers | Sciweavers

68

JMLR
2010

101views more JMLR 2010»

Efficient Reductions for Imitation Learning

14 years 4 months ago

Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the trainin...

Stéphane Ross, Drew Bagnell

claim paper

Read More »

82

click to vote

ACL
2008

127views Computational Linguistics» more ACL 2008»

Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation

14 years 11 months ago

Download www.aclweb.org

We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...

Verena Rieser, Oliver Lemon

claim paper

Read More »

93

click to vote

SASO
2010
IEEE

306views Control Systems» more SASO 2010»

A Decentralised Architecture for Multi-objective Autonomic Management

14 years 7 months ago

Download adadiaconescu.there-you-are.com

Designing and organising large numbers of autonomic resources into a coherent system is a difficult endeavour. It necessitates handling complex interactions among dynamic, heteroge...

Sylvain Frey, Philippe Lalanda, Ada Diaconescu

claim paper

Read More »

61

click to vote

KI
2007
Springer

118views Artificial Intelligence» more KI 2007»

Options in Readylog Reloaded - Generating Decision-Theoretic Plan Libraries in Golog

15 years 4 months ago

Download www-kbsg.informatik.rwth-aachen.de

Readylog is a logic-based agent programming language and combines many important features from other Golog dialects. One of the features of Readylog is to make use of decision-theo...

Lutz Böhnstedt, Alexander Ferrein, Gerhard La...

claim paper

Read More »

94

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

14 years 5 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers