Sciweavers

64 search results - page 11 / 13
» Multi-Agent Learning with Policy Prediction
Sort
View
LREC
2008
139views Education» more  LREC 2008»
13 years 7 months ago
Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation
The ultimate goal when building dialogue systems is to satisfy the needs of real users, but quality assurance for dialogue strategies is a non-trivial problem. The applied evaluat...
Verena Rieser, Oliver Lemon
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
14 years 10 days ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
PPOPP
2009
ACM
14 years 6 months ago
Mapping parallelism to multi-cores: a machine learning based approach
The efficient mapping of program parallelism to multi-core processors is highly dependent on the underlying architecture. This paper proposes a portable and automatic compiler-bas...
Zheng Wang, Michael F. P. O'Boyle
ICML
2005
IEEE
14 years 6 months ago
High speed obstacle avoidance using monocular vision and reinforcement learning
We consider the task of driving a remote control car at high speeds through unstructured outdoor environments. We present an approach in which supervised learning is first used to...
Jeff Michels, Ashutosh Saxena, Andrew Y. Ng
ABIALS
2008
Springer
13 years 7 months ago
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...
Matthias Rungger, Hao Ding, Olaf Stursberg