Search Sciweavers | Sciweavers

945 search results - page 50 / 189

» Dialog Convergence and Learning

146

click to vote

COLT
2008
Springer

103views Machine Learning» more COLT 2008»

More Efficient Internal-Regret-Minimizing Algorithms

15 years 6 months ago

Download www.cs.brown.edu

Standard no-internal-regret (NIR) algorithms compute a fixed point of a matrix, and hence typically require O(n3 ) run time per round of learning, where n is the dimensionality of...

Amy R. Greenwald, Zheng Li, Warren Schudy

claim paper

Read More »

132

Voted

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 5 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

145

click to vote

ISCIS
2003
Springer

107views Information Technology» more ISCIS 2003»

A New Continuous Action-Set Learning Automaton for Function Optimization

15 years 9 months ago

Download ce.sharif.edu

In this paper, we study an adaptive random search method based on continuous action-set learning automaton for solving stochastic optimization problems in which only the noisecorr...

Hamid Beigy, Mohammad Reza Meybodi

claim paper

Read More »

117

click to vote

NIPS
1996

98views Information Technology» more NIPS 1996»

Second-order Learning Algorithm with Squared Penalty Term

15 years 5 months ago

Download www.kecl.ntt.co.jp

This paper compares three penalty terms with respect to the efficiency of supervised learning, by using first- and second-order learning algorithms. Our experiments showed that fo...

Kazumi Saito, Ryohei Nakano

claim paper

Read More »

125

click to vote

TSMC
1998

97views more TSMC 1998»

Parallel algorithms for modules of learning automata

15 years 4 months ago

Download eprints.iisc.ernet.in

— Parallel algorithms are presented for modules of learning automata with the objective of improving their speed of convergence without compromising accuracy. A general procedure...

M. A. L. Thathachar, M. T. Arvind

claim paper

Read More »

« Prev « First page 50 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers