Search Sciweavers | Sciweavers

377 search results - page 1 / 76

» Convergence of Stochastic Iterative Dynamic Programming Algo...

129

Voted

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 4 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

125

Voted

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 9 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

114

Voted

ORL
2008

115views more ORL 2008»

On the convergence of stochastic dual dynamic programming and related methods

15 years 2 months ago

Download edoc.hu-berlin.de

We discuss the almost-sure convergence of a broad class of sampling algorithms for multi-stage stochastic linear programs. We provide a convergence proof based on the finiteness o...

Andrew B. Philpott, Z. Guan

claim paper

Read More »

112

click to vote

IOR
2010

98views more IOR 2010»

A Shadow Simplex Method for Infinite Linear Programs

15 years 3 days ago

Download www-personal.umich.edu

We present a Simplex-type algorithm, that is, an algorithm that moves from one extreme point of the infinite-dimensional feasible region to another not necessarily adjacent extrem...

Archis Ghate, Dushyant Sharma, Robert L. Smith

claim paper

Read More »

119

click to vote

CCE
2004

108views Software Engineering» more CCE 2004»

Improving convergence of the stochastic decomposition algorithm by using an efficient sampling technique

15 years 2 months ago

Download www.vri-custom.org

This work focuses on the basic stochastic decomposition (SD) algorithm of Higle and Sen [J.L. Higle, S. Sen, Stochastic Decomposition, Kluwer Academic Publishers, 1996] for two-st...

José María Ponce-Ortega, Vicente Ric...

claim paper

Read More »

« Prev « First page 1 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers