Sciweavers

508 search results - page 21 / 102
» Learning for stochastic dynamic programming
Sort
View
DAGSTUHL
2007
15 years 3 months ago
Learning Probabilistic Relational Dynamics for Multiple Tasks
The ways in which an agent’s actions affect the world can often be modeled compactly using a set of relational probabilistic planning rules. This paper addresses the problem of ...
Ashwin Deshpande, Brian Milch, Luke S. Zettlemoyer...
IPCO
2004
107views Optimization» more  IPCO 2004»
15 years 3 months ago
A Robust Optimization Approach to Supply Chain Management
Abstract. We propose a general methodology based on robust optimization to address the problem of optimally controlling a supply chain subject to stochastic demand in discrete time...
Dimitris Bertsimas, Aurélie Thiele
CDC
2010
IEEE
139views Control Systems» more  CDC 2010»
14 years 9 months ago
Q-learning and enhanced policy iteration in discounted dynamic programming
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...
Dimitri P. Bertsekas, Huizhen Yu
CVPR
2009
IEEE
16 years 9 months ago
Variational Layered Dynamic Textures
A dynamic texture is a generative model for video that treats the video as a sample from spatio-temporal stochastic process. One problem associated with the dynamic texture is t...
Antoni B. Chan, Nuno Vasconcelos
ML
1998
ACM
101views Machine Learning» more  ML 1998»
15 years 1 months ago
Elevator Group Control Using Multiple Reinforcement Learning Agents
Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...
Robert H. Crites, Andrew G. Barto