Search Sciweavers | Sciweavers

140 search results - page 11 / 28

» Structural Abstraction Experiments in Reinforcement Learning

108

Voted

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 4 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

click to vote

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

15 years 6 months ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

click to vote

IROS
2008
IEEE

165views Robotics» more IROS 2008»

Mutual development of behavior acquisition and recognition based on value system

15 years 6 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...

Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada

claim paper

Read More »

click to vote

ECML
2007
Springer

133views Machine Learning» more ECML 2007»

Structure Learning of Probabilistic Relational Models from Incomplete Relational Data

15 years 5 months ago

Download cs.nju.edu.cn

Abstract. Existing relational learning approaches usually work on complete relational data, but real-world data are often incomplete. This paper proposes the MGDA approach to learn...

Xiao-Lin Li, Zhi-Hua Zhou

claim paper

Read More »

126

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 6 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 11 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers