Sciweavers

1455 search results - page 67 / 291
» Exploiting Myopic Learning
Sort
View
ECCV
2010
Springer
15 years 1 months ago
Object of Interest Detection by Saliency Learning
In this paper, we present a method for object of interest detection. This method is statistical in nature and hinges in a model which combines salient features using a mixture of l...
AAMAS
2005
Springer
14 years 11 months ago
Coordinating Multiple Agents via Reinforcement Learning
In this paper, we focus on the coordination issues in a multiagent setting. Two coordination algorithms based on reinforcement learning are presented and theoretically analyzed. O...
Gang Chen, Zhonghua Yang, Hao He, Kiah Mok Goh
ML
2002
ACM
121views Machine Learning» more  ML 2002»
14 years 11 months ago
Near-Optimal Reinforcement Learning in Polynomial Time
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...
Michael J. Kearns, Satinder P. Singh
CVPR
2010
IEEE
15 years 8 months ago
Safety in Numbers: Learning Categories from Few Examples with Multi Model Knowledge Transfer
Learning object categories from small samples is a challenging problem, where machine learning tools can in general provide very few guarantees. Exploiting prior knowledge may be ...
Tatiana Tommasi, Francesco Orabona, Barbara Caputo
IJCNN
2006
IEEE
15 years 5 months ago
On derivation of stagewise second-order backpropagation by invariant imbedding for multi-stage neural-network learning
— We present a simple, intuitive argument based on “invariant imbedding” in the spirit of dynamic programming to derive a stagewise second-order backpropagation (BP) algorith...
Eiji Mizutani, Stuart Dreyfus