Sciweavers

5075 search results - page 189 / 1015
» Convergence
Sort
View
NIPS
2007
15 years 6 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
136
Voted
WCE
2007
15 years 5 months ago
Step-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter
—In this paper, we analyze the bounds of the fixed common step-size parameter GMDFµ for the generalized multidelay adaptive filter (GMDF). Frequency domain adaptive filters are ...
Junghsi Lee, Hsu Chang Huang
AUTOMATICA
2008
71views more  AUTOMATICA 2008»
15 years 4 months ago
Performance of convergence-based variable-gain control of optical storage drives
In this paper, a method for the performance assessment of a variable-gain control design for optical storage drives is proposed. The variablegain strategy is used to overcome well...
Nathan van de Wouw, H. A. Pastink, Marcel F. Heert...
145
Voted
CORR
2010
Springer
119views Education» more  CORR 2010»
15 years 4 months ago
Dynamic Policy Programming
In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...
Mohammad Gheshlaghi Azar, Hilbert J. Kappen
CORR
2008
Springer
128views Education» more  CORR 2008»
15 years 4 months ago
Distributed Consensus over Wireless Sensor Networks Affected by Multipath Fading
The design of sensor networks capable of reaching a consensus on a globally optimal decision test, without the need for a fusion center, is a problem that has received considerable...
Gesualdo Scutari, Sergio Barbarossa