Search Sciweavers | Sciweavers

3 search results - page 1 / 1

» When Policies Can Be Trusted: Analyzing a Criteria to Identi...

click to vote

AIPS
2010

174views Artificial Intelligence» more AIPS 2010»

When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters

13 years 7 months ago

Download www.cs.berkeley.edu

Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...

Emma Brunskill

claim paper

Read More »

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

13 years 11 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers