Search Sciweavers | Sciweavers

508 search results - page 21 / 102

» Learning for stochastic dynamic programming

121

click to vote

DAGSTUHL
2007

107views Software Engineering» more DAGSTUHL 2007»

Learning Probabilistic Relational Dynamics for Multiple Tasks

15 years 3 months ago

Download people.csail.mit.edu

The ways in which an agent’s actions affect the world can often be modeled compactly using a set of relational probabilistic planning rules. This paper addresses the problem of ...

Ashwin Deshpande, Brian Milch, Luke S. Zettlemoyer...

claim paper

Read More »

105

click to vote

IPCO
2004

107views Optimization» more IPCO 2004»

A Robust Optimization Approach to Supply Chain Management

15 years 3 months ago

Download www.cs.brown.edu

Abstract. We propose a general methodology based on robust optimization to address the problem of optimally controlling a supply chain subject to stochastic demand in discrete time...

Dimitris Bertsimas, Aurélie Thiele

claim paper

Read More »

118

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 9 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

149

click to vote

CVPR
2009
IEEE

306views Computer Vision» more CVPR 2009»

Variational Layered Dynamic Textures

16 years 9 months ago

Download www.vision.jhu.edu

A dynamic texture is a generative model for video that treats the video as a sample from spatio-temporal stochastic process. One problem associated with the dynamic texture is t...

Antoni B. Chan, Nuno Vasconcelos

claim paper

Read More »

110

click to vote

ML
1998
ACM

101views Machine Learning» more ML 1998»

Elevator Group Control Using Multiple Reinforcement Learning Agents

15 years 1 months ago

Download www.clear.rice.edu

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...

Robert H. Crites, Andrew G. Barto

claim paper

Read More »

« Prev « First page 21 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers