Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

84

Voted

ICML
2008
IEEE

favoriteEmaildiscussreport

147views Machine Learning» more ICML 2008»

Apprenticeship learning using linear programming

16 years 1 months ago

Apprenticeship learning using linear programming

Download www.cs.ualberta.ca

In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in that the MDP's true reward function is assumed to be unknown. We show how to frame apprenticeship learning as a linear programming problem, and show that using an off-the-shelf LP solver to solve this problem results in a substantial improvement in running time over existing methods -- up to two orders of magnitude faster in our experiments. Additionally, our approach produces stationary policies, while all existing methods for apprenticeship learning output policies that are "mixed", i.e. randomized combinations of stationary policies. The technique used is general enough to convert any mixed policy to a stationary policy.

Umar Syed, Michael H. Bowling, Robert E. Schapire

Real-time Traffic

Apprenticeship Learning Output | ICML 2008 | Machine Learning | Stationary Policies | Stationary Policy |

claim paper

Related Content

» Exploration and apprenticeship learning in reinforcement learning

» Apprenticeship learning via inverse reinforcement learning

» Solving Semiinfinite Linear Programs Using BoostingLike Methods

» Metric and Kernel Learning Using a Linear Transformation

» Learning Biped Locomotion from First Principles on a Simulated Humanoid Robot Using Linear...

» Deep Learning Made Easier by Linear Transformations in Perceptrons

» Evolving recurrent models using linear GP

» MultipleInstance Learning via Disjunctive Programming Boosting

» Learning to Rank by Maximizing AUC with Linear Programming

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2008
Where	ICML
Authors	Umar Syed, Michael H. Bowling, Robert E. Schapire

Comments (0)