Sciweavers

3523 search results - page 88 / 705
» Feature interaction in policies
Sort
View
JMLR
2010
101views more  JMLR 2010»
14 years 4 months ago
Efficient Reductions for Imitation Learning
Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the trainin...
Stéphane Ross, Drew Bagnell
ACL
2008
14 years 11 months ago
Learning Effective Multimodal Dialogue Strategies from Wizard-of-Oz Data: Bootstrapping and Evaluation
We address two problems in the field of automatic optimization of dialogue strategies: learning effective dialogue strategies when no initial data or system exists, and evaluating...
Verena Rieser, Oliver Lemon
SASO
2010
IEEE
14 years 7 months ago
A Decentralised Architecture for Multi-objective Autonomic Management
Designing and organising large numbers of autonomic resources into a coherent system is a difficult endeavour. It necessitates handling complex interactions among dynamic, heteroge...
Sylvain Frey, Philippe Lalanda, Ada Diaconescu
KI
2007
Springer
15 years 4 months ago
Options in Readylog Reloaded - Generating Decision-Theoretic Plan Libraries in Golog
Readylog is a logic-based agent programming language and combines many important features from other Golog dialects. One of the features of Readylog is to make use of decision-theo...
Lutz Böhnstedt, Alexander Ferrein, Gerhard La...
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
14 years 5 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas