Sciweavers

1277 search results - page 139 / 256
» Terminating Decision Algorithms Optimally
Sort
View
96
Voted
ECML
2007
Springer
15 years 9 months ago
Safe Q-Learning on Complete History Spaces
In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...
Stephan Timmer, Martin Riedmiller
129
Voted
GECCO
2004
Springer
142views Optimization» more  GECCO 2004»
15 years 9 months ago
Improving MACS Thanks to a Comparison with 2TBNs
Abstract. Factored Markov Decision Processes is the theoretical framework underlying multi-step Learning Classifier Systems research. This framework is mostly used in the context ...
Olivier Sigaud, Thierry Gourdin, Pierre-Henri Wuil...
149
Voted
PR
2006
84views more  PR 2006»
15 years 3 months ago
Document zone content classification and its performance evaluation
This paper describes an algorithm for the determination of zone content type of a given zone within a document image. We take a statistical based approach and represent each zone ...
Yalin Wang, Ihsin T. Phillips, Robert M. Haralick
AAAI
2006
15 years 5 months ago
Targeting Specific Distributions of Trajectories in MDPs
We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...
David L. Roberts, Mark J. Nelson, Charles Lee Isbe...
ECCV
2004
Springer
16 years 5 months ago
An Information-Based Measure for Grouping Quality
We propose a method for measuring the quality of a grouping result, based on the following observation: a better grouping result provides more information about the true, unknown g...
Erik A. Engbers, Michael Lindenbaum, Arnold W. M. ...