Search Sciweavers | Sciweavers

124 search results - page 12 / 25

» Apprenticeship learning using linear programming

158

click to vote

EMNLP
2010

178views Natural Language Processing» more EMNLP 2010»

Turbo Parsers: Dependency Parsing by Approximate Variational Inference

15 years 4 months ago

Download www.cs.cmu.edu

We present a unified view of two state-of-theart non-projective dependency parsers, both approximate: the loopy belief propagation parser of Smith and Eisner (2008) and the relaxe...

André F. T. Martins, Noah A. Smith, Eric P....

claim paper

Read More »

189

click to vote

JMLR
2012

178views Programming Languages» more JMLR 2012»

A Stick-Breaking Likelihood for Categorical Data Analysis with Latent Gaussian Models

13 years 8 months ago

Download www.cs.ubc.ca

The development of accurate models and eﬃcient algorithms for the analysis of multivariate categorical data are important and longstanding problems in machine learning and compu...

Mohammad Emtiyaz Khan, Shakir Mohamed, Benjamin M....

claim paper

Read More »

198

click to vote

ICASSP
2011
IEEE

281views Signal Processing» more ICASSP 2011»

Multiple kernel nonnegative matrix factorization

14 years 9 months ago

Download mirlab.org

Kernel nonnegative matrix factorization (KNMF) is a recent kernel extension of NMF, where matrix factorization is carried out in a reproducing kernel Hilbert space (RKHS) with a f...

Shounan An, Jeong-Min Yun, Seungjin Choi

claim paper

Read More »

160

click to vote

ICANN
1997
Springer

87views Neural Networks» more ICANN 1997»

On Learning Soccer Strategies

15 years 10 months ago

Download igitur-archive.library.uu.nl

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy but may behave differently due to position-dependent inputs. All...

Rafal Salustowicz, Marco Wiering, Jürgen Schm...

claim paper

Read More »

163

Voted

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

16 years 23 days ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

« Prev « First page 12 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers