optimal value function

152

Voted

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

13 years 11 months ago

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

158

Voted

AIPS
2011

216views Artificial Intelligence» more AIPS 2011»

Heuristic Search for Generalized Stochastic Shortest Path MDPs

14 years 7 months ago

Download www.cs.washington.edu

Research in efﬁcient methods for solving inﬁnite-horizon MDPs has so far concentrated primarily on discounted MDPs and the more general stochastic shortest path problems (SSPs...

Andrey Kolobov, Mausam, Daniel S. Weld, Hector Gef...

claim paper

Read More »

127

click to vote

MP
2002

113views more MP 2002»

A note on sensitivity of value functions of mathematical programs with complementarity constraints

15 years 3 months ago

Download www.eng.cam.ac.uk

Using standard nonlinear programming (NLP) theory, we establish formulas for first and second order directional derivatives for optimal value functions of parametric mathematical ...

Xinmin Hu, Daniel Ralph

claim paper

Read More »

117

click to vote

JGO
2008

115views more JGO 2008»

Smoothing by mollifiers. Part I: semi-infinite optimization

15 years 3 months ago

Download kop.ior.kit.edu

We show that a compact feasible set of a standard semi-infinite optimization problem can be approximated arbitrarily well by a level set of a single smooth function with certain r...

Hubertus Th. Jongen, Oliver Stein

claim paper

Read More »

145

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

15 years 4 months ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

136

Voted

WSC
2007

120views Modeling And Simulation» more WSC 2007»

Path-wise estimators and cross-path regressions: an application to evaluating portfolio strategies

15 years 5 months ago

Download www.informs-sim.org

Recently developed dual techniques allow us to evaluate a given sub-optimal dynamic portfolio policy by using the policy to construct an upper bound on the optimal value function....

Martin B. Haugh, Ashish Jain

claim paper

Read More »

137

click to vote

FLAIRS
2008

108views Artificial Intelligence» more FLAIRS 2008»

A Novel Prioritization Technique for Solving Markov Decision Processes

15 years 5 months ago

Download damas.ift.ulaval.ca

We address the problem of computing an optimal value function for Markov decision processes. Since finding this function quickly and accurately requires substantial computation ef...

Jilles Steeve Dibangoye, Brahim Chaib-draa, Abdel-...

claim paper

Read More »

132

click to vote

PRICAI
2000
Springer

128views Artificial Intelligence» more PRICAI 2000»

A POMDP Approximation Algorithm That Anticipates the Need to Observe

15 years 7 months ago

Download eecs.oregonstate.edu

This paper introduces the even-odd POMDP, an approximation to POMDPs in which the world is assumed to be fully observable every other time step. The even-odd POMDP can be converte...

Valentina Bayer Zubek, Thomas G. Dietterich

claim paper

Read More »

123

Voted

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

16 years 4 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

128

click to vote

ICML
2005
IEEE

135views Machine Learning» more ICML 2005»

Finite time bounds for sampling based fitted value iteration

16 years 4 months ago

Download www.machinelearning.org

In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...

Csaba Szepesvári, Rémi Munos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers