Search Sciweavers | Sciweavers

210 search results - page 10 / 42

» An analysis of reinforcement learning with function approxim...

105

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 5 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

103

click to vote

GECCO
2004
Springer

122views Optimization» more GECCO 2004»

Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems

15 years 5 months ago

Download www.cs.york.ac.uk

This paper introduces a gradient-based reward prediction update mechanism to the XCS classiﬁer system as applied in neuralnetwork type learning and function approximation mechani...

Martin V. Butz, David E. Goldberg, Pier Luca Lanzi

claim paper

Read More »

click to vote

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

15 years 6 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 6 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 2 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

« Prev « First page 10 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers