Search Sciweavers | Sciweavers

2005 search results - page 153 / 401

» Decisive Markov Chains

107

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 3 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

128

click to vote

ENTCS
2006

134views more ENTCS 2006»

Partial Order Reduction for Probabilistic Branching Time

15 years 1 months ago

Download www.win.tue.nl

In the past, partial order reduction has been used successfully to combat the state explosion problem in the context of model checking for non-probabilistic systems. For both line...

Christel Baier, Pedro R. D'Argenio, Marcus Grö...

claim paper

Read More »

123

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 7 days ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

132

Voted

ECCV
2004
Springer

248views Computer Vision» more ECCV 2004»

An MCMC-Based Particle Filter for Tracking Multiple Interacting Targets

16 years 3 months ago

Download www.cc.gatech.edu

Abstract. We describe a Markov chain Monte Carlo based particle filter that effectively deals with interacting targets, i.e., targets that are influenced by the proximity and/or be...

Zia Khan, Tucker R. Balch, Frank Dellaert

claim paper

Read More »

135

click to vote

STOC
1989
ACM

99views Algorithms» more STOC 1989»

A Random Polynomial Time Algorithm for Approximating the Volume of Convex Bodies

15 years 5 months ago

Download www.math.cmu.edu

We present a randomised polynomial time algorithm for approximating the volume of a convex body K in n-dimensional Euclidean space. The proof of correctness of the algorithm relie...

Martin E. Dyer, Alan M. Frieze, Ravi Kannan

claim paper

Read More »

« Prev « First page 153 / 401 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers