Sciweavers

101 search results - page 1 / 21
» Convergence of Gradient Dynamics with a Variable Learning Ra...
Sort
View
ICML
2001
IEEE
14 years 5 months ago
Convergence of Gradient Dynamics with a Variable Learning Rate
As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...
Michael H. Bowling, Manuela M. Veloso
GLOBECOM
2007
IEEE
13 years 11 months ago
A Generalized Gradient Scheduling Algorithm in Wireless Networks for Variable Rate Transmission
— Average transmission rate and rate oscillation are two important performance metrics for most wireless services. Both are often needed to be optimized in multi-user scheduling ...
Xiaolu Zhang, Meixia Tao, Chun Sum Ng
CDC
2010
IEEE
196views Control Systems» more  CDC 2010»
12 years 12 months ago
Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema
The asymptotic behavior of stochastic gradient algorithms is studied. Relying on some results of differential geometry (Lojasiewicz gradient inequality), the almost sure pointconve...
Vladislav B. Tadic
NIPS
2001
13 years 6 months ago
Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning
We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...
Gregory Z. Grudic, Lyle H. Ungar
IJIT
2004
13 years 6 months ago
Improving the Convergence of the Backpropagation Algorithm Using Local Adaptive Techniques
Since the presentation of the backpropagation algorithm, a vast variety of improvements of the technique for training a feed forward neural networks have been proposed. This articl...
Z. Zainuddin, N. Mahat, Y. Abu Hassan