Sciweavers

166 search results - page 16 / 34
» Online model learning in adversarial Markov decision process...
Sort
View
UAI
2000
14 years 11 months ago
PEGASUS: A policy search method for large MDPs and POMDPs
We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...
Andrew Y. Ng, Michael I. Jordan
67
Voted
NAACL
2007
14 years 11 months ago
Comparing User Simulation Models For Dialog Strategy Learning
This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...
Hua Ai, Joel R. Tetreault, Diane J. Litman
85
Voted
ICIP
2002
IEEE
15 years 11 months ago
Learning a decision boundary for face detection
This paper describes a pattern classification approach for detecting frontal-view faces via learning a decision boundary. The classification can be achieved either by explicit est...
Tae-Kyun Kim, Donggeon Kong, Sang Ryong Kim
JMLR
2006
125views more  JMLR 2006»
14 years 9 months ago
Spam Filtering Using Statistical Data Compression Models
Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an active adversary, which constantly attempts to evade fi...
Andrej Bratko, Gordon V. Cormack, Bogdan Filipic, ...
NIPS
1996
14 years 11 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies