Search Sciweavers | Sciweavers

215 search results - page 19 / 43

» Model-Based Reinforcement Learning with Continuous States an...

125

Voted

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

15 years 10 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 2 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

122

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 2 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

133

click to vote

GECCO
2007
Springer

179views Optimization» more GECCO 2007»

XCSF with computed continuous action

15 years 7 months ago

Download www.cs.bham.ac.uk

Wilson introduced XCSF as a successor to XCS. The major development of XCSF is the concept of a computed prediction. The efficiency of XCSF in dealing with numerical input and con...

Trung Hau Tran, Cédric Sanza, Yves Duthen, ...

claim paper

Read More »

111

click to vote

JUCS
2007

98views more JUCS 2007»

Focus of Attention in Reinforcement Learning

15 years 1 months ago

Download www.research.rutgers.edu

Abstract: Classiﬁcation-based reinforcement learning (RL) methods have recently been proposed as an alternative to the traditional value-function based methods. These methods use...

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

« Prev « First page 19 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers