Search Sciweavers | Sciweavers

283 search results - page 14 / 57

» Abstracting Reusable Cases from Reinforcement Learning

182

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 5 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

118

click to vote

ICCBR
2005
Springer

91views Automated Reasoning» more ICCBR 2005»

Opportunities for CBR in Learning by Doing

15 years 11 months ago

Download gaia.fdi.ucm.es

In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...

Pedro Pablo Gómez-Martín, Marco Anto...

claim paper

Read More »

144

click to vote

IJCAI
2007

140views Artificial Intelligence» more IJCAI 2007»

Utile Distinctions for Relational Reinforcement Learning

15 years 6 months ago

Download www.ijcai.org

We introduce an approach to autonomously creating state space abstractions for an online reinforcement learning agent using a relational representation. Our approach uses a tree-b...

William Dabney, Amy McGovern

claim paper

Read More »

155

click to vote

ICCBR
2010
Springer

274views Automated Reasoning» more ICCBR 2010»

Reducing the Memory Footprint of Temporal Difference Learning over Finitely Many States by Using Case-Based Generalization

15 years 9 months ago

Download www.cse.lehigh.edu

In this paper we present an approach for reducing the memory footprint requirement of temporal difference methods in which the set of states is finite. We use case-based generaliza...

Matt Dilts, Héctor Muñoz-Avila

claim paper

Read More »

178

click to vote

DIS
2009
Springer

121views Theoretical Computer Science» more DIS 2009»

OMFP: An Approach for Online Mass Flow Prediction in CFB Boilers

16 years 5 hour ago

Download www.win.tue.nl

Abstract. Fuel feeding and inhomogeneity of fuel typically cause process ﬂuctuations in the circulating ﬂuidized bed (CFB) boilers. If control systems fail to compensate the �...

Indre Zliobaite, Jorn Bakker, Mykola Pechenizkiy

claim paper

Read More »

« Prev « First page 14 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers