Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

142

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 4 months ago

Relating reinforcement learning performance to classification performance

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any unobservable assumptions (no assumption of independence, small mixing time, fully observable states, or even hidden states) and the resulting statement is independent of the number of states or actions. The statement is critically dependent on the size of the rewards and prediction performance of the created classifiers. We also provide some general guidelines for obtaining good classification performance on the created subproblems. In particular, we discuss possible methods for generating training examples for a classifier learning algorithm.

John Langford, Bianca Zadrozny

Real-time Traffic

Binary Classification Performance | Classifier Learning Algorithm | ICML 2005 | Machine Learning | Unobservable Assumptions |

claim paper

Related Content

» GradientBased Learning Updates Improve XCS Performance in Multistep Problems

» Reinforcement Learning of Listener Response for Mood Classification of Audio

» Predicting relative performance of classifiers from samples

» Knowledge transfer via advice taking

» Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

» Transfer Learning in Reinforcement Learning Problems Through Partial Policy Recycling

» Guiding Inference Through Relational Reinforcement Learning

» Incremental Possibilistic Approach for Online Clustering and Classification

» Incremental Learning of Relational Action Rules

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2005
Where	ICML
Authors	John Langford, Bianca Zadrozny

Comments (0)