Sciweavers

140 search results - page 11 / 28
» Structural Abstraction Experiments in Reinforcement Learning
Sort
View
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 8 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
FBIT
2007
IEEE
15 years 10 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
IROS
2008
IEEE
165views Robotics» more  IROS 2008»
15 years 10 months ago
Mutual development of behavior acquisition and recognition based on value system
Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...
Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada
ECML
2007
Springer
15 years 10 months ago
Structure Learning of Probabilistic Relational Models from Incomplete Relational Data
Abstract. Existing relational learning approaches usually work on complete relational data, but real-world data are often incomplete. This paper proposes the MGDA approach to learn...
Xiao-Lin Li, Zhi-Hua Zhou
JMLR
2010
189views more  JMLR 2010»
14 years 10 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...