Search Sciweavers | Sciweavers

1233 search results - page 114 / 247

» Reinforcement learning

190

click to vote

ICML
2010
IEEE

171views Machine Learning» more ICML 2010»

Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis

15 years 8 months ago

Download www.stat.lsa.umich.edu

We introduce new, efficient algorithms for value iteration with multiple reward functions and continuous state. We also give an algorithm for finding the set of all nondominated a...

Daniel J. Lizotte, Michael H. Bowling, Susan A. Mu...

claim paper

Read More »

215

click to vote

NN
2006
Springer

72views Neural Networks» more NN 2006»

Neural systems implicated in delayed and probabilistic reinforcement

15 years 7 months ago

Download egret.psychol.cam.ac.uk

This review considers the theoretical problems facing agents that must learn and choose on the basis of reward or reinforcement that is uncertain or delayed, in implicit or proced...

Rudolf N. Cardinal

claim paper

Read More »

195

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 8 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

191

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 8 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

196

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 8 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 114 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers