Search Sciweavers | Sciweavers

23 search results - page 3 / 5

» Learning from Reinforcement and Advice Using Composite Rewar...

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 29 days ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

NIPS
2007

133views Information Technology» more NIPS 2007»

Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning

15 years 1 months ago

Download books.nips.cc

Electrical power management in large-scale IT systems such as commercial datacenters is an application area of rapidly growing interest from both an economic and ecological perspe...

Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O....

claim paper

Read More »

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

15 years 6 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 3 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

103

click to vote

JAIR
2000

131views more JAIR 2000»

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

14 years 11 months ago

Download www.jair.org

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method...

Marilyn A. Walker

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers