Sciweavers

1227 search results - page 99 / 246
» Learning Rates for Q-Learning
Sort
View
IJCAI
2001
14 years 11 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
NIPS
2000
14 years 11 months ago
Learning Joint Statistical Models for Audio-Visual Fusion and Segregation
People can understand complex auditory and visual information, often using one to disambiguate the other. Automated analysis, even at a lowlevel, faces severe challenges, includin...
John W. Fisher III, Trevor Darrell, William T. Fre...
CLEIEJ
2008
82views more  CLEIEJ 2008»
14 years 10 months ago
Postal Envelope Segmentation using Learning-Based Approach
This paper presents a learning-based approach to segment postal address blocks where the learning step uses only one pair of images (a sample image and its ideal segmented solutio...
Horacio Andrés Legal-Ayala, Jacques Facon, ...
FOCM
2008
140views more  FOCM 2008»
14 years 10 months ago
Online Gradient Descent Learning Algorithms
This paper considers the least-square online gradient descent algorithm in a reproducing kernel Hilbert space (RKHS) without explicit regularization. We present a novel capacity i...
Yiming Ying, Massimiliano Pontil
ISCI
2008
95views more  ISCI 2008»
14 years 9 months ago
Modified constrained learning algorithms incorporating additional functional constraints into neural networks
In this paper, two modified constrained learning algorithms are proposed to obtain better generalization performance and faster convergence rate. The additional cost terms of the ...
Fei Han, Qing-Hua Ling, De-Shuang Huang