Search Sciweavers | Sciweavers

536 search results - page 51 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

110

click to vote

COLT
1993
Springer

108views Machine Learning» more COLT 1993»

Learning from a Population of Hypotheses

15 years 7 months ago

Download hebb.mit.edu

We introduce a new formal model in which a learning algorithm must combine a collection of potentially poor but statistically independent hypothesis functions in order to approxima...

Michael J. Kearns, H. Sebastian Seung

claim paper

Read More »

128

Voted

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 4 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

163

click to vote

TNN
2010

216views Management» more TNN 2010»

Simplifying mixture models through function approximation

14 years 10 months ago

Download books.nips.cc

Finite mixture model is a powerful tool in many statistical learning problems. In this paper, we propose a general, structure-preserving approach to reduce its model complexity, w...

Kai Zhang, James T. Kwok

claim paper

Read More »

121

Voted

ISCAS
2006
IEEE

103views Hardware» more ISCAS 2006»

Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot

15 years 9 months ago

Download web.mit.edu

— This paper describes a general approach for the unsupervised learning of behaviors in a behavior-based robot. The key idea is to formalize a behavior produced by a Motor Map dr...

Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Pa...

claim paper

Read More »

171

Voted

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 5 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

« Prev « First page 51 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers