Sciweavers

499 search results - page 30 / 100
» Model Minimization in Markov Decision Processes
Sort
View
ICML
2004
IEEE
16 years 18 days ago
Learning low dimensional predictive representations
Predictive state representations (PSRs) have recently been proposed as an alternative to partially observable Markov decision processes (POMDPs) for representing the state of a dy...
Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...
ICIP
2010
IEEE
14 years 9 months ago
3D augmented Markov random field for object recognition
In this paper, we propose to use 3D information to augment the Markov random field (MRF) model for object recognition. Conventional MRF for image-based object recognition usually ...
Wei Yu, Ahmed Bilal Ashraf, Yao-Jen Chang, Congcon...
CLIMA
2011
13 years 11 months ago
Verifying Team Formation Protocols with Probabilistic Model Checking
Multi-agent systems are an increasingly important software paradigm and in many of its applications agents cooperate to achieve a particular goal. This requires the design of effi...
Taolue Chen, Marta Z. Kwiatkowska, David Parker, A...
IJCAI
2001
15 years 1 months ago
Adaptive Control of Acyclic Progressive Processing Task Structures
The progressive processing model allows a system to trade off resource consumption against the quality of the outcome by mapping each activity to a graph of potential solution met...
Stéphane Cardon, Abdel-Illah Mouaddib, Shlo...
ATAL
2008
Springer
15 years 1 months ago
Expediting RL by using graphical structures
The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...
Peng Dai, Alexander L. Strehl, Judy Goldsmith