Search Sciweavers | Sciweavers

166 search results - page 16 / 34

» Online model learning in adversarial Markov decision process...

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 21 days ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

click to vote

NAACL
2007

125views Computational Linguistics» more NAACL 2007»

Comparing User Simulation Models For Dialog Strategy Learning

15 years 23 days ago

Download www.cs.pitt.edu

This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strate...

Hua Ai, Joel R. Tetreault, Diane J. Litman

claim paper

Read More »

click to vote

ICIP
2002
IEEE

192views Image Processing» more ICIP 2002»

Learning a decision boundary for face detection

16 years 29 days ago

Download svr-www.eng.cam.ac.uk

This paper describes a pattern classification approach for detecting frontal-view faces via learning a decision boundary. The classification can be achieved either by explicit est...

Tae-Kyun Kim, Donggeon Kong, Sang Ryong Kim

claim paper

Read More »

click to vote

JMLR
2006

125views more JMLR 2006»

Spam Filtering Using Statistical Data Compression Models

14 years 11 months ago

Download jmlr.csail.mit.edu

Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an active adversary, which constantly attempts to evade fi...

Andrej Bratko, Gordon V. Cormack, Bogdan Filipic, ...

claim paper

Read More »

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 20 days ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 16 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers