Search Sciweavers | Sciweavers

113 search results - page 22 / 23

» Model Approximation for HEXQ Hierarchical Reinforcement Lear...

168

click to vote

ICML
2009
IEEE

141views Machine Learning» more ICML 2009»

A stochastic memoizer for sequence data

16 years 8 months ago

Download www.gatsby.ucl.ac.uk

We propose an unbounded-depth, hierarchical, Bayesian nonparametric model for discrete sequence data. This model can be estimated from a single training sequence, yet shares stati...

Frank Wood, Cédric Archambeau, Jan Gasthaus...

claim paper

Read More »

213

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

16 years 8 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

196

click to vote

JMLR
2008

110views more JMLR 2008»

Cross-Validation Optimization for Large Scale Structured Classification Kernel Methods

15 years 7 months ago

Download jmlr.csail.mit.edu

We propose a highly efficient framework for penalized likelihood kernel methods applied to multiclass models with a large, structured set of classes. As opposed to many previous a...

Matthias W. Seeger

claim paper

Read More »

224

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

179

click to vote

ICML
2008
IEEE

144views Machine Learning» more ICML 2008»

An HDP-HMM for systems with state persistence

16 years 8 months ago

Download www.cs.berkeley.edu

The hierarchical Dirichlet process hidden Markov model (HDP-HMM) is a flexible, nonparametric model which allows state spaces of unknown size to be learned from data. We demonstra...

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan,...

claim paper

Read More »

« Prev « First page 22 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers