Search Sciweavers | Sciweavers

536 search results - page 18 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

15 years 6 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Basis function construction for hierarchical reinforcement learning

15 years 23 days ago

Download www.cs.brown.edu

This paper introduces an approach to automatic basis function construction for Hierarchical Reinforcement Learning (HRL) tasks. We describe some considerations that arise when con...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

100

click to vote

ECAI
2010
Springer

211views Artificial Intelligence» more ECAI 2010»

Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions

15 years 23 days ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Multiagent Reinforcement Learning algorithms, combining Case-Based Reasoning...

Reinaldo A. C. Bianchi, Ramon López de M&aa...

claim paper

Read More »

112

Voted

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 14 days ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 4 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 18 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers