Search Sciweavers | Sciweavers

771 search results - page 37 / 155

» Markov Decision Processes with Arbitrary Reward Processes

172

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 5 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

242

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 1 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

169

click to vote

QEST
2010
IEEE

154views Modeling and Simulation» more QEST 2010»

Symblicit Calculation of Long-Run Averages for Concurrent Probabilistic Systems

15 years 3 months ago

Download www.informatik.uni-freiburg.de

Abstract--Model checkers for concurrent probabilistic systems have become very popular within the last decade. The study of long-run average behavior has however received only scan...

Ralf Wimmer, Bettina Braitling, Bernd Becker, Erns...

claim paper

Read More »

168

click to vote

FLAIRS
2004

140views Artificial Intelligence» more FLAIRS 2004»

State Space Reduction For Hierarchical Reinforcement Learning

15 years 7 months ago

Download ranger.uta.edu

er provides new techniques for abstracting the state space of a Markov Decision Process (MDP). These techniques extend one of the recent minimization models, known as -reduction, ...

Mehran Asadi, Manfred Huber

claim paper

Read More »

117

click to vote

IPPS
2000
IEEE

115views Distributed And Parallel Com...» more IPPS 2000»

A Decision-Process Analysis of Implicit Coscheduling

15 years 10 months ago

Download www.cs.umd.edu

ThispaperpresentsatheoreticalframeworkbasedonBayesian decision theory for analyzing recently reported results on implicit coscheduling of parallel applications on clusters of work...

Radha Poovendran, Peter J. Keleher, John S. Baras

claim paper

Read More »

« Prev « First page 37 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers