Sciweavers

146 search results - page 11 / 30
» Online Gradient Descent Learning Algorithms
Sort
View
JMLR
2012
13 years 8 days ago
Sparse Additive Machine
We develop a high dimensional nonparametric classification method named sparse additive machine (SAM), which can be viewed as a functional version of support vector machine (SVM)...
Tuo Zhao, Han Liu
JMLR
2010
189views more  JMLR 2010»
14 years 4 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
ATAL
2004
Springer
15 years 3 months ago
Product Distribution Theory for Control of Multi-Agent Systems
Product Distribution (PD) theory is a new framework for controlling Multi-Agent Systems (MAS’s). First we review one motivation of PD theory, as the information-theoretic extens...
Chiu Fan Lee, David H. Wolpert
IJACTAICIT
2010
153views more  IJACTAICIT 2010»
14 years 4 months ago
Prediction Using Recurrent Neural Network Based Fuzzy Inference system by the Modified Bees Algorithm
In this paper, a recurrent neural network based fuzzy inference system (RNFIS) for prediction is proposed. A recurrent network is embedded in the RNFIS by adding feedback connecti...
Zahra Khanmirzaei, Mohammad Teshnehlab
ICML
2007
IEEE
15 years 10 months ago
Exponentiated gradient algorithms for log-linear structured prediction
Conditional log-linear models are a commonly used method for structured prediction. Efficient learning of parameters in these models is therefore an important problem. This paper ...
Amir Globerson, Terry Koo, Xavier Carreras, Michae...