Search Sciweavers | Sciweavers

162 search results - page 10 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

179

ICASSP
2008
IEEE

215views Signal Processing» more ICASSP 2008»

Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition

16 years 14 days ago

Download www.itr-rescue.org

The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good ...

Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Triv...

claim paper

Read More »

168

click to vote

ISCC
2000
IEEE

104views Communications» more ISCC 2000»

Dynamic Routing and Wavelength Assignment Using First Policy Iteration

15 years 10 months ago

Download www.netlab.tkk.fi

With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, deﬁes an exact solution bec...

Esa Hyytiä, Jorma T. Virtamo

claim paper

Read More »

185

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

15 years 6 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

160

click to vote

KI
2007
Springer

136views Artificial Intelligence» more KI 2007»

Solving Decentralized Continuous Markov Decision Problems with Structured Reward

15 years 5 months ago

Download juban.free.fr

We present an approximation method that solves a class of Decentralized hybrid Markov Decision Processes (DEC-HMDPs). These DEC-HMDPs have both discrete and continuous state variab...

Emmanuel Benazera

claim paper

Read More »

154

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 22 days ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

« Prev « First page 10 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers