Search Sciweavers | Sciweavers

1233 search results - page 92 / 247

» Reinforcement learning

182

Voted

CORR
2011
Springer

194views Education» more CORR 2011»

Accelerating Reinforcement Learning through Implicit Imitation

14 years 10 months ago

Download www.aaai.org

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...

Craig Boutilier, Bob Price

claim paper

Read More »

175

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

175

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 7 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

185

Voted

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

184

click to vote

ISCA
2008
IEEE

137views Hardware» more ISCA 2008»

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach

16 years 1 months ago

Download www.csl.cornell.edu

Eﬃciently utilizing oﬀ-chip DRAM bandwidth is a critical issue in designing cost-eﬀective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...

Engin Ipek, Onur Mutlu, José F. Martí...

claim paper

Read More »

« Prev « First page 92 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers