Search Sciweavers | Sciweavers

286 search results - page 1 / 58

» Using inaccurate models in reinforcement learning

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 10 days ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

15 years 6 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

105

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 27 days ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

CORR
1998
Springer

164views Education» more CORR 1998»

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

14 years 11 months ago

Download zeus.cs.uoi.gr

A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...

Aristidis Likas, Isaac E. Lagaris

claim paper

Read More »

113

click to vote

SAC
2005
ACM

149views Applied Computing» more SAC 2005»

Reinforcement learning agents with primary knowledge designed by analytic hierarchy process

15 years 5 months ago

Download k2x.ice.ous.ac.jp

This paper presents a novel model of reinforcement learning agents. A feature of our learning agent model is to integrate analytic hierarchy process (AHP) into a standard reinforc...

Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narih...

claim paper

Read More »

« Prev « First page 1 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers