Sciweavers

945 search results - page 50 / 189
» Dialog Convergence and Learning
Sort
View
COLT
2008
Springer
14 years 11 months ago
More Efficient Internal-Regret-Minimizing Algorithms
Standard no-internal-regret (NIR) algorithms compute a fixed point of a matrix, and hence typically require O(n3 ) run time per round of learning, where n is the dimensionality of...
Amy R. Greenwald, Zheng Li, Warren Schudy
ICML
2008
IEEE
15 years 10 months ago
Reinforcement learning in the presence of rare events
We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...
Jordan Frank, Shie Mannor, Doina Precup
ISCIS
2003
Springer
15 years 2 months ago
A New Continuous Action-Set Learning Automaton for Function Optimization
In this paper, we study an adaptive random search method based on continuous action-set learning automaton for solving stochastic optimization problems in which only the noisecorr...
Hamid Beigy, Mohammad Reza Meybodi
NIPS
1996
14 years 11 months ago
Second-order Learning Algorithm with Squared Penalty Term
This paper compares three penalty terms with respect to the efficiency of supervised learning, by using first- and second-order learning algorithms. Our experiments showed that fo...
Kazumi Saito, Ryohei Nakano
TSMC
1998
97views more  TSMC 1998»
14 years 9 months ago
Parallel algorithms for modules of learning automata
— Parallel algorithms are presented for modules of learning automata with the objective of improving their speed of convergence without compromising accuracy. A general procedure...
M. A. L. Thathachar, M. T. Arvind