Search Sciweavers | Sciweavers

18

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

13 years 6 months ago

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

14

click to vote

FLAIRS
2003

141views Artificial Intelligence» more FLAIRS 2003»

Learning from Reinforcement and Advice Using Composite Reward Functions

13 years 6 months ago

Download ranger.uta.edu

1 Reinforcement learning has become a widely used methodology for creating intelligent agents in a wide range of applications. However, its performance deteriorates in tasks with s...

Vinay N. Papudesi, Manfred Huber

claim paper

Read More »

16

click to vote

MDAI
2005
Springer

138views Artificial Intelligence» more MDAI 2005»

Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes

13 years 11 months ago

Download www.math.s.chiba-u.ac.jp

We formulate a fuzzy perceptive model for Markov decision processes with discounted payoﬀ in which the perception for transition probabilities is described by fuzzy sets. Our aim...

Masami Kurano, Masami Yasuda, Jun-ichi Nakagami, Y...

claim paper

Read More »

18

click to vote

ALT
2007
Springer

119views Machine Learning» more ALT 2007»

Pseudometrics for State Aggregation in Average Reward Markov Decision Processes

14 years 2 months ago

Download personal.unileoben.ac.at

We consider how state similarity in average reward Markov decision processes (MDPs) may be described by pseudometrics. Introducing the notion of adequate pseudometrics which are we...

Ronald Ortner

claim paper

Read More »

21

click to vote

COLT
2007
Springer

143views Machine Learning» more COLT 2007»

Bounded Parameter Markov Decision Processes with Average Reward Criterion

13 years 11 months ago

Download ttic.uchicago.edu

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, t...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers