Sciweavers

656 search results - page 23 / 132
» Complexity of finite-horizon Markov decision process problem...
Sort
View
CORR
2008
Springer
103views Education» more  CORR 2008»
15 years 1 months ago
Quickest Change Detection of a Markov Process Across a Sensor Array
Recent attention in quickest change detection in the multi-sensor setting has been on the case where the densities of the observations change at the same instant at all the sensor...
Vasanthan Raghavan, Venugopal V. Veeravalli
FOCS
2007
IEEE
15 years 8 months ago
On the Complexity of Nash Equilibria and Other Fixed Points (Extended Abstract)
d Abstract) Kousha Etessami LFCS, School of Informatics University of Edinburgh Mihalis Yannakakis Department of Computer Science Columbia University We reexamine what it means to...
Kousha Etessami, Mihalis Yannakakis
141
Voted
IJCAI
2001
15 years 3 months ago
An Improved Grid-Based Approximation Algorithm for POMDPs
Although a partially observable Markov decision process (POMDP) provides an appealing model for problems of planning under uncertainty, exact algorithms for POMDPs are intractable...
Rong Zhou, Eric A. Hansen
140
Voted
IWQOS
2011
Springer
14 years 4 months ago
An MDP-based admission control for a QoS-aware service-oriented system
In this paper, we address the problem of providing a service broker, which offers to prospective users a composite service with a range of different Quality of Service (QoS) class...
Marco Abundo, Valeria Cardellini, Francesco Lo Pre...
ALT
2006
Springer
15 years 10 months ago
Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...
Daniil Ryabko, Marcus Hutter