Search Sciweavers | Sciweavers

94

Voted

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 1 months ago

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

91

Voted

ICML
2005
IEEE

113views Machine Learning» more ICML 2005»

Learning discontinuities with products-of-sigmoids for switching between local models

16 years 1 months ago

Download homepages.inf.ed.ac.uk

Sensorimotor data from many interesting physical interactions comprises discontinuities. While existing locally weighted learning approaches aim at learning smooth functions, we p...

Marc Toussaint, Sethu Vijayakumar

claim paper

Read More »

109

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

Learn to weight terms in information retrieval using category information

16 years 1 months ago

Download www.cse.msu.edu

How to assign appropriate weights to terms is one of the critical issues in information retrieval. Many term weighting schemes are unsupervised. They are either based on the empir...

Rong Jin, Joyce Y. Chai, Luo Si

claim paper

Read More »

100

click to vote

ICML
2005
IEEE

201views Machine Learning» more ICML 2005»

Interactive learning of mappings from visual percepts to actions

16 years 1 months ago

Download www.machinelearning.org

We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...

Justus H. Piater, Sébastien Jodogne

claim paper

Read More »

110

Voted

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 1 months ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers