Search Sciweavers | Sciweavers

473 search results - page 27 / 95

» Optimal policy switching algorithms for reinforcement learni...

100

click to vote

ILP
2007
Springer

250views Automated Reasoning» more ILP 2007»

Learning Relational Options for Inductive Transfer in Relational Reinforcement Learning

15 years 6 months ago

Download people.cs.kuleuven.be

In reinforcement learning problems, an agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent...

Tom Croonenborghs, Kurt Driessens, Maurice Bruynoo...

claim paper

Read More »

107

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 19 days ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 19 days ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

NIPS
2003

108views Information Technology» more NIPS 2003»

Policy Search by Dynamic Programming

15 years 1 months ago

Download books.nips.cc

We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...

J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...

claim paper

Read More »

click to vote

EURONGI
2005
Springer

115views Computer Networks» more EURONGI 2005»

An Afterstates Reinforcement Learning Approach to Optimize Admission Control in Mobile Cellular Networks

15 years 5 months ago

Download jogiguz.webs.upv.es

We deploy a novel Reinforcement Learning optimization technique based on afterstates learning to determine the gain that can be achieved by incorporating movement prediction inform...

José Manuel Giménez-Guzmán, J...

claim paper

Read More »

« Prev « First page 27 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers