Sciweavers

140 search results - page 12 / 28
» icml 2005
Sort
View
ICML
2005
IEEE
15 years 10 months ago
Dynamic preferences in multi-criteria reinforcement learning
The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...
Sriraam Natarajan, Prasad Tadepalli
ICML
2005
IEEE
15 years 10 months ago
Learning discontinuities with products-of-sigmoids for switching between local models
Sensorimotor data from many interesting physical interactions comprises discontinuities. While existing locally weighted learning approaches aim at learning smooth functions, we p...
Marc Toussaint, Sethu Vijayakumar
ICML
2005
IEEE
15 years 10 months ago
Learn to weight terms in information retrieval using category information
How to assign appropriate weights to terms is one of the critical issues in information retrieval. Many term weighting schemes are unsupervised. They are either based on the empir...
Rong Jin, Joyce Y. Chai, Luo Si
ICML
2005
IEEE
15 years 10 months ago
Interactive learning of mappings from visual percepts to actions
We introduce flexible algorithms that can automatically learn mappings from images to actions by interacting with their environment. They work by introducing an image classifier i...
Justus H. Piater, Sébastien Jodogne
ICML
2005
IEEE
15 years 10 months ago
Relating reinforcement learning performance to classification performance
We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...
John Langford, Bianca Zadrozny