Sciweavers

771 search results - page 49 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
FGR
2006
IEEE
205views Biometrics» more  FGR 2006»
15 years 8 months ago
Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition
We present a novel tracking algorithm that uses dynamic programming to determine the path of target objects and that is able to track an arbitrary number of different objects. The...
Philippe Dreuw, Thomas Deselaers, David Rybach, Da...
IJCAI
2001
15 years 3 months ago
An Improved Grid-Based Approximation Algorithm for POMDPs
Although a partially observable Markov decision process (POMDP) provides an appealing model for problems of planning under uncertainty, exact algorithms for POMDPs are intractable...
Rong Zhou, Eric A. Hansen
IWQOS
2011
Springer
14 years 4 months ago
An MDP-based admission control for a QoS-aware service-oriented system
In this paper, we address the problem of providing a service broker, which offers to prospective users a composite service with a range of different Quality of Service (QoS) class...
Marco Abundo, Valeria Cardellini, Francesco Lo Pre...
AIPS
2007
15 years 4 months ago
Prioritizing Bellman Backups without a Priority Queue
Several researchers have shown that the efficiency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of...
Peng Dai, Eric A. Hansen
AAAI
2012
13 years 4 months ago
Kernel-Based Reinforcement Learning on Representative States
Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...
Branislav Kveton, Georgios Theocharous