Sciweavers

651 search results - page 95 / 131
» Algorithms for Inverse Reinforcement Learning
Sort
View
AROBOTS
2008
131views more  AROBOTS 2008»
14 years 9 months ago
Active audition using the parameter-less self-organising map
This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...
Erik Berglund, Joaquin Sitte, Gordon Wyeth
CORR
2006
Springer
140views Education» more  CORR 2006»
14 years 9 months ago
Nearly optimal exploration-exploitation decision thresholds
While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...
Christos Dimitrakakis
SIGDIAL
2010
14 years 7 months ago
Sparse Approximate Dynamic Programming for Dialog Management
Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...
Senthilkumar Chandramohan, Matthieu Geist, Olivier...
FOCS
1994
IEEE
15 years 1 months ago
The Power of Team Exploration: Two Robots Can Learn Unlabeled Directed Graphs
We show that two cooperating robots can learn exactly any strongly-connected directed graph with n indistinguishable nodes in expected time polynomial in n. We introduce a new typ...
Michael A. Bender, Donna K. Slonim
ICML
2004
IEEE
15 years 10 months ago
Multi-task feature and kernel selection for SVMs
We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...
Tony Jebara