Search Sciweavers | Sciweavers

548 search results - page 17 / 110

» A New Way to Introduce Knowledge into Reinforcement Learning

144

Voted

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 4 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

127

Voted

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

15 years 8 months ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

Voted

KDD
2010
ACM

282views Data Mining» more KDD 2010»

Optimizing debt collections using constrained reinforcement learning

15 years 5 months ago

Download www.prem-melville.com

In this paper, we propose and develop a novel approach to the problem of optimally managing the tax, and more generally debt, collections processes at ﬁnancial institutions. Our...

Naoki Abe, Prem Melville, Cezar Pendus, Chandan K....

claim paper

Read More »

111

click to vote

AGENTS
1999
Springer

105views Security Privacy» more AGENTS 1999»

Team-Partitioned, Opaque-Transition Reinforcement Learning

15 years 6 months ago

Download www.cs.ucf.edu

In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...

Peter Stone, Manuela M. Veloso

claim paper

Read More »

121

click to vote

JMLR
2008

124views more JMLR 2008»

Learning Control Knowledge for Forward Search Planning

15 years 1 months ago

Download web.engr.oregonstate.edu

A number of today's state-of-the-art planners are based on forward state-space search. The impressive performance can be attributed to progress in computing domain independen...

Sung Wook Yoon, Alan Fern, Robert Givan

claim paper

Read More »

« Prev « First page 17 / 110 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers