Search Sciweavers | Sciweavers

499 search results - page 35 / 100

» Model Minimization in Markov Decision Processes

106

click to vote

ATAL
2007
Springer

94views Intelligent Agents» more ATAL 2007»

Graphical models for online solutions to interactive POMDPs

15 years 8 months ago

Download www.cs.uga.edu

We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear t...

Prashant Doshi, Yifeng Zeng, Qiongyu Chen

claim paper

Read More »

click to vote

ATAL
2007
Springer

112views Intelligent Agents» more ATAL 2007»

A globally optimal algorithm for TTD-MDPs

15 years 8 months ago

Download www.cc.gatech.edu

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)—a variant of MDPs in which the goal is to realize a speciﬁed distrib...

Sooraj Bhat, David L. Roberts, Mark J. Nelson, Cha...

claim paper

Read More »

125

click to vote

ATAL
2010
Springer

157views Intelligent Agents» more ATAL 2010»

Augmenting appearance-based localization and navigation using belief update

15 years 3 months ago

Download www.aamas-conference.org

Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...

George Chrysanthakopoulos, Guy Shani

claim paper

Read More »

109

Voted

COMPLEX
2009
Springer

109views Theoretical Computer Science» more COMPLEX 2009»

Non-sufficient Memories That Are Sufficient for Prediction

15 years 5 months ago

Download personal-homepages.mis.mpg.de

The causal states of computational mechanics define the minimal sufficient (prescient) memory for a given stationary stochastic process. They induce the -machine which is a hidden...

Wolfgang Löhr, Nihat Ay

claim paper

Read More »

111

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 2 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

« Prev « First page 35 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers