Search Sciweavers | Sciweavers

945 search results - page 35 / 189

» Dialog Convergence and Learning

132

Voted

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

15 years 10 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

129

Voted

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 5 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

111

Voted

NIPS
2007

162views Information Technology» more NIPS 2007»

Bundle Methods for Machine Learning

15 years 5 months ago

Download books.nips.cc

We present a globally convergent method for regularized risk minimization problems. Our method applies to Support Vector estimation, regression, Gaussian Processes, and any other ...

Alex J. Smola, S. V. N. Vishwanathan, Quoc V. Le

claim paper

Read More »

142

click to vote

NIPS
1993

103views Information Technology» more NIPS 1993»

Optimal Stochastic Search and Adaptive Momentum

15 years 5 months ago

Download www.bme.ogi.edu

Stochastic optimization algorithms typically use learning rate schedules that behave asymptotically as (t) = 0=t. The ensemble dynamics (Leen and Moody, 1993) for such algorithms ...

Todd K. Leen, Genevieve B. Orr

claim paper

Read More »

118

click to vote

ICML
2007
IEEE

146views Machine Learning» more ICML 2007»

Best of both: a hybridized centroid-medoid clustering heuristic

16 years 4 months ago

Download www.machinelearning.org

Although each iteration of the popular kMeans clustering heuristic scales well to larger problem sizes, it often requires an unacceptably-high number of iterations to converge to ...

Nizar Grira, Michael E. Houle

claim paper

Read More »

« Prev « First page 35 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers