Search Sciweavers | Sciweavers

945 search results - page 33 / 189

» Dialog Convergence and Learning

click to vote

COLT
2003
Springer

146views Machine Learning» more COLT 2003»

A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria

15 years 2 months ago

Download www.cs.berkeley.edu

A general class of no-regret learning algorithms, called no-Φ-regret learning algorithms, is deﬁned which spans the spectrum from no-external-regret learning to no-internal-reg...

Amy R. Greenwald, Amir Jafari

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 2 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

JMLR
2012

213views Programming Languages» more JMLR 2012»

Distance Metric Learning with Eigenvalue Optimization

13 years 3 days ago

Download jmlr.csail.mit.edu

The main theme of this paper is to develop a novel eigenvalue optimization framework for learning a Mahalanobis metric. Within this context, we introduce a novel metric learning a...

Yiming Ying, Peng Li

claim paper

Read More »

127

click to vote

JMLR
2012

250views Programming Languages» more JMLR 2012»

Online Incremental Feature Learning with Denoising Autoencoders

13 years 3 days ago

Download web.eecs.umich.edu

While determining model complexity is an important problem in machine learning, many feature learning algorithms rely on cross-validation to choose an optimal number of features, ...

Guanyu Zhou, Kihyuk Sohn, Honglak Lee

claim paper

Read More »

click to vote

AAAI
2012

219views Intelligent Agents» more AAAI 2012»

Transfer Learning with Graph Co-Regularization

13 years 2 days ago

Download www.cs.ust.hk

Transfer learning proves to be effective for leveraging labeled data in the source domain to build an accurate classiﬁer in the target domain. The basic assumption behind transf...

Mingsheng Long, Jianmin Wang 0001, Guiguang Ding, ...

claim paper

Read More »

« Prev « First page 33 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers