Search Sciweavers | Sciweavers

82

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 22 days ago

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

85

click to vote

ICML
2008
IEEE

110views Machine Learning» more ICML 2008»

Non-parametric policy gradients: a unified treatment of propositional and relational domains

16 years 22 days ago

Download www-kd.iai.uni-bonn.de

Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains on...

Kristian Kersting, Kurt Driessens

claim paper

Read More »

82

click to vote

ICML
2008
IEEE

229views Machine Learning» more ICML 2008»

Classification using discriminative restricted Boltzmann machines

16 years 22 days ago

Download www.cs.toronto.edu

Recently, many applications for Restricted Boltzmann Machines (RBMs) have been developed for a large variety of learning problems. However, RBMs are usually used as feature extrac...

Hugo Larochelle, Yoshua Bengio

claim paper

Read More »

80

click to vote

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 22 days ago

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear pref...

Leon Barrett, Srini Narayanan

claim paper

Read More »

108

click to vote

ICML
2008
IEEE

154views Machine Learning» more ICML 2008»

Beam sampling for the infinite hidden Markov model

16 years 22 days ago

Download mlg.eng.cam.ac.uk

The infinite hidden Markov model is a nonparametric extension of the widely used hidden Markov model. Our paper introduces a new inference algorithm for the infinite Hidden Markov...

Jurgen Van Gael, Yunus Saatci, Yee Whye Teh, Zoubi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers