Search Sciweavers | Sciweavers

1455 search results - page 67 / 291

» Exploiting Myopic Learning

137

click to vote

ECCV
2010
Springer

226views Computer Vision» more ECCV 2010»

Object of Interest Detection by Saliency Learning

15 years 4 months ago

Download users.cecs.anu.edu.au

In this paper, we present a method for object of interest detection. This method is statistical in nature and hinges in a model which combines salient features using a mixture of l...

claim paper

Read More »

103

Voted

AAMAS
2005
Springer

147views Intelligent Agents» more AAMAS 2005»

Coordinating Multiple Agents via Reinforcement Learning

15 years 2 months ago

Download www.icis.ntu.edu.sg

In this paper, we focus on the coordination issues in a multiagent setting. Two coordination algorithms based on reinforcement learning are presented and theoretically analyzed. O...

Gang Chen, Zhonghua Yang, Hao He, Kiah Mok Goh

claim paper

Read More »

121

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 2 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

134

click to vote

CVPR
2010
IEEE

378views Computer Vision» more CVPR 2010»

Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer

15 years 11 months ago

Download www.idiap.ch

Learning object categories from small samples is a challenging problem, where machine learning tools can in general provide very few guarantees. Exploiting prior knowledge may be ...

Tatiana Tommasi, Francesco Orabona, Barbara Caputo

claim paper

Read More »

117

click to vote

IJCNN
2006
IEEE

109views Neural Networks» more IJCNN 2006»

On derivation of stagewise second-order backpropagation by invariant imbedding for multi-stage neural-network learning

15 years 9 months ago

Download www.ieor.berkeley.edu

— We present a simple, intuitive argument based on “invariant imbedding” in the spirit of dynamic programming to derive a stagewise second-order backpropagation (BP) algorith...

Eiji Mizutani, Stuart Dreyfus

claim paper

Read More »

« Prev « First page 67 / 291 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers