Search Sciweavers | Sciweavers

121 search results - page 9 / 25

» Learning Decision Theoretic Utilities through Reinforcement ...

133

click to vote

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

16 years 8 days ago

Download teamcore.usc.edu

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

156

click to vote

CLA
2007

251views Artificial Intelligence» more CLA 2007»

Policies Generalization in Reinforcement Learning using Galois Partitions Lattices

15 years 7 months ago

Download sunsite.informatik.rwth-aachen.de

The generalization of policies in reinforcement learning is a main issue, both from the theoretical model point of view and for their applicability. However, generalizing from a se...

Marc Ricordeau, Michel Liquiere

claim paper

Read More »

256

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 12 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

159

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 8 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

170

click to vote

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

16 years 4 days ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

« Prev « First page 9 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers