Search Sciweavers | Sciweavers

86 search results - page 10 / 18

» Evolution of reward functions for reinforcement learning

102

click to vote

ICML
2003
IEEE

105views Machine Learning» more ICML 2003»

Principled Methods for Advising Reinforcement Learning Agents

16 years 2 months ago

Download www.hpl.hp.com

An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...

Eric Wiewiora, Garrison W. Cottrell, Charles Elkan

claim paper

Read More »

105

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 2 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

120

click to vote

ICML
1999
IEEE

152views Machine Learning» more ICML 1999»

Distributed Value Functions

16 years 2 months ago

Download www.ri.cmu.edu

Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...

claim paper

Read More »

100

click to vote

IJCAI
2007

156views Artificial Intelligence» more IJCAI 2007»

Online Learning and Exploiting Relational Models in Reinforcement Learning

15 years 2 months ago

Download www.ijcai.org

In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have ...

Tom Croonenborghs, Jan Ramon, Hendrik Blockeel, Ma...

claim paper

Read More »

click to vote

AIPS
2006

141views Artificial Intelligence» more AIPS 2006»

Combining Stochastic Task Models with Reinforcement Learning for Dynamic Scheduling

15 years 2 months ago

Download www.aaai.org

We view dynamic scheduling as a sequential decision problem. Firstly, we introduce a generalized planning operator, the stochastic task model (STM), which predicts the effects of ...

Malcolm J. A. Strens

claim paper

Read More »

« Prev « First page 10 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers