Search Sciweavers | Sciweavers

86 search results - page 9 / 18

» Evolution of reward functions for reinforcement learning

106

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 3 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

100

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 3 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

117

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 2 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

107

click to vote

ICML
2005
IEEE

145views Machine Learning» more ICML 2005»

Proto-value functions: developmental reinforcement learning

16 years 2 months ago

Download www.cs.umass.edu

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...

Sridhar Mahadevan

claim paper

Read More »

124

click to vote

AROBOTS
1999

104views more AROBOTS 1999»

Reinforcement Learning Soccer Teams with Incomplete World Models

15 years 1 months ago

Download igitur-archive.library.uu.nl

We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...

Marco Wiering, Rafal Salustowicz, Jürgen Schm...

claim paper

Read More »

« Prev « First page 9 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers