Search Sciweavers | Sciweavers

3412 search results - page 23 / 683

» Efficient Reinforcement Learning

244

click to vote

WSC
2007

166views Modeling And Simulation» more WSC 2007»

Optimizing time warp simulation with reinforcement learning techniques

15 years 9 months ago

Download www.informs-sim.org

Adaptive Time Warp protocols in the literature are usually based on a pre-deﬁned analytic model of the system, expressed as a closed form function that maps system state to cont...

Jun Wang, Carl Tropper

claim paper

Read More »

189

Voted

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 9 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

203

click to vote

ICMLA
2003

159views Machine Learning» more ICMLA 2003»

A Distributed Reinforcement Learning Approach to Pattern Inference in Go

15 years 8 months ago

Download mysite.verizon.net

— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...

Myriam Abramson, Harry Wechsler

claim paper

Read More »

191

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

16 years 2 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

189

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

« Prev « First page 23 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers