Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» On polynomial cases of the unichain classification problem f...

click to vote

ORL
2008

68views more ORL 2008»

On polynomial cases of the unichain classification problem for Markov Decision Processes

13 years 4 months ago

Download www.ams.sunysb.edu

The unichain classification problem detects whether a finite state and action MDP is unichain under all deterministic policies. This problem is NP-hard [11]. This paper provides p...

Eugene A. Feinberg, Fenghsu Yang

claim paper

Read More »

click to vote

ICONIP
2009

125views Information Technology» more ICONIP 2009»

Quasi-Deterministic Partially Observable Markov Decision Processes

13 years 2 months ago

Download damas.ift.ulaval.ca

We study a subclass of POMDPs, called quasi-deterministic POMDPs (QDET-POMDPs), characterized by deterministic actions and stochastic observations. While this framework does not mo...

Camille Besse, Brahim Chaib-draa

claim paper

Read More »

click to vote

ATAL
2010
Springer

136views Intelligent Agents» more ATAL 2010»

Quasi deterministic POMDPs and DecPOMDPs

13 years 6 months ago

Download www.damas.ift.ulaval.ca

In this paper, we study a particular subclass of partially observable models, called quasi-deterministic partially observable Markov decision processes (QDET-POMDPs), characterize...

Camille Besse, Brahim Chaib-draa

claim paper

Read More »

click to vote

SIGECOM
2009
ACM

114views ECommerce» more SIGECOM 2009»

Policy teaching through reward function learning

13 years 11 months ago

Download www.eecs.harvard.edu

Policy teaching considers a Markov Decision Process setting in which an interested party aims to inﬂuence an agent’s decisions by providing limited incentives. In this paper, ...

Haoqi Zhang, David C. Parkes, Yiling Chen

claim paper

Read More »

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

14 years 5 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers