Sciweavers

126 search results - page 5 / 26
» nips 2001
Sort
View
NIPS
2001
14 years 11 months ago
A Variational Approach to Learning Curves
We combine the replica approach from statistical physics with a variational approach to analyze learning curves analytically. We apply the method to Gaussian process regression. A...
Dörthe Malzahn, Manfred Opper
NIPS
2001
14 years 11 months ago
Linear-time inference in Hierarchical HMMs
The hierarchical hidden Markov model (HHMM) is a generalization of the hidden Markov model (HMM) that models sequences with structure at many length/time scales [FST98]. Unfortuna...
K. P. Murphy, Mark A. Paskin
NIPS
2001
14 years 11 months ago
Generalization Performance of Some Learning Problems in Hilbert Functional Spaces
We investigate the generalization performance of some learning problems in Hilbert functional Spaces. We introduce a notion of convergence of the estimated functional predictor to...
T. Zhang
NIPS
2001
14 years 11 months ago
A General Greedy Approximation Algorithm with Applications
Greedy approximation algorithms have been frequently used to obtain sparse solutions to learning problems. In this paper, we present a general greedy algorithm for solving a class...
T. Zhang
NIPS
2001
14 years 11 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar