Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

14

ICML
2009
IEEE

favoriteEmaildiscussreport

123views Machine Learning» more ICML 2009»

Constraint relaxation in approximate linear programs

14 years 5 months ago

Constraint relaxation in approximate linear programs

Download anytime.cs.umass.edu

Approximate Linear Programming (ALP) is a reinforcement learning technique with nice theoretical properties, but it often performs poorly in practice. We identify some reasons for the poor quality of ALP solutions in problems where the approximation induces virtual loops. We then introduce two methods for improving solution quality. One method rolls out selected constraints of the ALP, guided by the dual information. The second method is a relaxation of the ALP, based on external penalty methods. The latter method is applicable in domains in which rolling out constraints is impractical. Both approaches show promising empirical results for simple benchmark problems as well as for a realistic blood inventory management problem.

Marek Petrik, Shlomo Zilberstein

Real-time Traffic

ALP Solutions | External Penalty Methods | ICML 2009 | Machine Learning | Simple Benchmark Problems |

claim paper

Related Content

» On the knapsack closure of 01 Integer Linear Programs

» How tight is the corner relaxation

» Linear approximations for rate control in video coding

» Linear Programming Relaxations and Belief Propagation An Empirical Study

» On linear and semidefinite programming relaxations for hypergraph matching

» On a linear programming approach to the discrete Willmore boundary value problem and gener...

» Boosting Classifiers with Tightened L0Relaxation Penalties

» A Constraint Generation Approach to Learning Stable Linear Dynamical Systems

» Quadratic programming relaxations for metric labeling and Markov random field MAP estimati...

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2009
Where	ICML
Authors	Marek Petrik, Shlomo Zilberstein

Comments (0)