Search Sciweavers | Sciweavers

682 search results - page 66 / 137

» One-Counter Markov Decision Processes

101

click to vote

ATAL
2008
Springer

103views Intelligent Agents» more ATAL 2008»

The permutable POMDP: fast solutions to POMDPs for preference elicitation

15 years 3 months ago

Download mapleleaf.csail.mit.edu

The ability for an agent to reason under uncertainty is crucial for many planning applications, since an agent rarely has access to complete, error-free information about its envi...

Finale Doshi, Nicholas Roy

claim paper

Read More »

135

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 4 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

116

click to vote

ATAL
2010
Springer

136views Intelligent Agents» more ATAL 2010»

Quasi deterministic POMDPs and DecPOMDPs

15 years 2 months ago

Download www.damas.ift.ulaval.ca

In this paper, we study a particular subclass of partially observable models, called quasi-deterministic partially observable Markov decision processes (QDET-POMDPs), characterize...

Camille Besse, Brahim Chaib-draa

claim paper

Read More »

140

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 3 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

135

Voted

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 1 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

« Prev « First page 66 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers