Search Sciweavers | Sciweavers

2566 search results - page 31 / 514

» Relating reinforcement learning performance to classificatio...

148

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

150

click to vote

EURONGI
2005
Springer

115views Computer Networks» more EURONGI 2005»

An Afterstates Reinforcement Learning Approach to Optimize Admission Control in Mobile Cellular Networks

15 years 11 months ago

Download jogiguz.webs.upv.es

We deploy a novel Reinforcement Learning optimization technique based on afterstates learning to determine the gain that can be achieved by incorporating movement prediction inform...

José Manuel Giménez-Guzmán, J...

claim paper

Read More »

146

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 7 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

134

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

15 years 12 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

180

click to vote

FLAIRS
2004

146views Artificial Intelligence» more FLAIRS 2004»

Developing Task Specific Sensing Strategies Using Reinforcement Learning

15 years 7 months ago

Download ranger.uta.edu

Robots that can adapt and perform multiple tasks promise to be a powerful tool with many applications. In order to achieve such robots, control systems have to be constructed that...

Srividhya Rajendran, Manfred Huber

claim paper

Read More »

« Prev « First page 31 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers