Sciweavers

1997 search results - page 206 / 400
» On the convergence of Hill's method
Sort
View
WSC
2007
15 years 3 months ago
Subset selection and optimization for selecting binomial systems applied to supersaturated design generation
The problem of finding the binomial population with the highest success probability is considered when the number of binomial populations is large. A new rigorous indifference zo...
Ning Zheng, Theodore Allen
WSC
2008
15 years 3 months ago
Discrete stochastic optimization using linear interpolation
We consider discrete stochastic optimization problems where the objective function can only be estimated by a simulation oracle; the oracle is defined only at the discrete points....
Honggang Wang, Bruce W. Schmeiser
EUC
2008
Springer
15 years 3 months ago
Cooperative Node Localization for Mobile Sensor Networks
In this paper, we propose a range-free cooperative localization algorithm for mobile sensor networks by combining hop distance measurements and particle filtering. In the hop dist...
Hongyang Chen, Marcelo H. T. Martins, Pei Huang, H...
AAAI
2010
15 years 2 months ago
Relative Entropy Policy Search
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...
Jan Peters, Katharina Mülling, Yasemin Altun
NIPS
2008
15 years 2 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...