Search Sciweavers | Sciweavers

3281 search results - page 516 / 657

» Bases for parametrized iterativity

108

click to vote

ICML
2008
IEEE

195views Machine Learning» more ICML 2008»

Estimating local optimums in EM algorithm over Gaussian mixture model

16 years 2 months ago

Download www.comp.nus.edu.sg

EM algorithm is a very popular iteration-based method to estimate the parameters of Gaussian Mixture Model from a large observation set. However, in most cases, EM algorithm is no...

Zhenjie Zhang, Bing Tian Dai, Anthony K. H. Tung

claim paper

Read More »

125

Voted

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 2 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

109

click to vote

ICML
2005
IEEE

145views Machine Learning» more ICML 2005»

Proto-value functions: developmental reinforcement learning

16 years 2 months ago

Download www.cs.umass.edu

This paper presents a novel framework called proto-reinforcement learning (PRL), based on a mathematical model of a proto-value function: these are task-independent basis function...

Sridhar Mahadevan

claim paper

Read More »

108

Voted

ICML
2004
IEEE

203views Machine Learning» more ICML 2004»

Training conditional random fields via gradient tree boosting

16 years 2 months ago

Download web.engr.oregonstate.edu

Conditional Random Fields (CRFs; Lafferty, McCallum, & Pereira, 2001) provide a flexible and powerful model for learning to assign labels to elements of sequences in such appl...

Thomas G. Dietterich, Adam Ashenfelter, Yaroslav B...

claim paper

Read More »

126

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 2 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 516 / 657 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers