Sciweavers

995 search results - page 68 / 199
» nips 2007
Sort
View
NIPS
2007
15 years 2 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
96
Voted
NIPS
2007
15 years 2 months ago
Agreement-Based Learning
The learning of probabilistic models with many hidden variables and nondecomposable dependencies is an important and challenging problem. In contrast to traditional approaches bas...
Percy Liang, Dan Klein, Michael I. Jordan
104
Voted
NIPS
2007
15 years 2 months ago
Discriminative Log-Linear Grammars with Latent Variables
We demonstrate that log-linear grammars with latent variables can be practically trained using discriminative methods. Central to efficient discriminative training is a hierarchi...
Slav Petrov, Dan Klein
101
Voted
NIPS
2007
15 years 2 months ago
Using Deep Belief Nets to Learn Covariance Kernels for Gaussian Processes
We show how to use unlabeled data and a deep belief net (DBN) to learn a good covariance kernel for a Gaussian process. We first learn a deep generative model of the unlabeled da...
Ruslan Salakhutdinov, Geoffrey E. Hinton
84
Voted
NIPS
2007
15 years 2 months ago
Random Sampling of States in Dynamic Programming
We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using lo...
Christopher G. Atkeson, Benjamin Stephens