Sciweavers

2005 search results - page 260 / 401
» Decisive Markov Chains
Sort
View
GLOBECOM
2006
IEEE
15 years 7 months ago
Dynamic Wavelength Sharing Policies for Absolute QoS in OBS Networks
— We consider the problem of providing absolute QoS guarantees to multiple classes of users of an OBS network in terms of the end-to-end burst loss. We employ Markov decision pro...
Li Yang, George N. Rouskas
ICTAI
2006
IEEE
15 years 7 months ago
A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem
We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...
Lhassane Idoumghar, René Schott
NAACL
2007
15 years 3 months ago
Comparing User Simulation Models For Dialog Strategy Learning
This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...
Hua Ai, Joel R. Tetreault, Diane J. Litman
NIPS
2007
15 years 3 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
UAI
2000
15 years 2 months ago
Approximately Optimal Monitoring of Plan Preconditions
Monitoring plan preconditions can allow for replanning when a precondition fails, generally far in advance of the point in the plan where the precondition is relevant. However, mo...
Craig Boutilier