Search Sciweavers | Sciweavers

682 search results - page 40 / 137

» One-Counter Markov Decision Processes

214

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

13 years 10 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

16 years 20 days ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

108

click to vote

SIGMETRICS
2000
ACM

105views Hardware» more SIGMETRICS 2000»

Using the exact state space of a Markov model to compute approximate stationary measures

15 years 4 months ago

Download www.cs.ucr.edu

We present a new approximation algorithm based on an exact representation of the state space S, using decision diagrams, and of the transition rate matrix R, using Kronecker algeb...

Andrew S. Miner, Gianfranco Ciardo, Susanna Donate...

claim paper

Read More »

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 1 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

click to vote

IJFCS
2008

130views more IJFCS 2008»

Equivalence of Labeled Markov Chains

14 years 12 months ago

Download mtc.epfl.ch

We consider the equivalence problem for labeled Markov chains (LMCs), where each state is labeled with an observation. Two LMCs are equivalent if every finite sequence of observat...

Laurent Doyen, Thomas A. Henzinger, Jean-Fran&cced...

claim paper

Read More »

« Prev « First page 40 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers