Sciweavers

3643 search results - page 171 / 729
» Learning Submodular Functions
Sort
View
NIPS
2001
15 years 7 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
NN
2000
Springer
192views Neural Networks» more  NN 2000»
15 years 6 months ago
A new algorithm for learning in piecewise-linear neural networks
Piecewise-linear (PWL) neural networks are widely known for their amenability to digital implementation. This paper presents a new algorithm for learning in PWL networks consistin...
Emad Gad, Amir F. Atiya, Samir I. Shaheen, Ayman E...
132
Voted
ALT
2010
Springer
15 years 8 months ago
Distribution-Dependent PAC-Bayes Priors
We further develop the idea that the PAC-Bayes prior can be informed by the data-generating distribution. We prove sharp bounds for an existing framework of Gibbs algorithms, and ...
Guy Lever, François Laviolette, John Shawe-...
185
Voted
CORR
2002
Springer
100views Education» more  CORR 2002»
15 years 6 months ago
A neural model for multi-expert architectures
We present a generalization of conventional artificial neural networks that allows for a functional equivalence to multi-expert systems. The new model provides an architectural fr...
Marc Toussaint
175
Voted
ICPR
2000
IEEE
16 years 7 months ago
Image Recognition on the Neural Network Based on Multi-Valued Neurons
Multi-valued neurons are the neural processing elements with complex-valued weights, huge functionality (it is possible to implement on the single neuron arbitrary mapping describ...
Igor N. Aizenberg, Naum N. Aizenberg, Constantine ...