Search Sciweavers | Sciweavers

147

ICML
2008
IEEE

154views Machine Learning» more ICML 2008»

Beam sampling for the infinite hidden Markov model

16 years 5 months ago

The infinite hidden Markov model is a nonparametric extension of the widely used hidden Markov model. Our paper introduces a new inference algorithm for the infinite Hidden Markov...

Jurgen Van Gael, Yunus Saatci, Yee Whye Teh, Zoubi...

claim paper

Read More »

152

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 5 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

133

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 5 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

160

click to vote

ICML
2001
IEEE

266views Machine Learning» more ICML 2001»

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

16 years 5 months ago

Download www.cis.upenn.edu

We present conditional random fields, a framework for building probabilistic models to segment and label sequence data. Conditional random fields offer several advantages over hid...

John D. Lafferty, Andrew McCallum, Fernando C. N. ...

claim paper

Read More »

168

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

15 years 10 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers