Sciweavers

58 search results - page 12 / 12
» Fuzzy Approximation for Convergent Model-Based Reinforcement...
Sort
View
TSMC
2008
146views more  TSMC 2008»
13 years 5 months ago
Decentralized Learning in Markov Games
Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is tha...
Peter Vrancx, Katja Verbeeck, Ann Nowé
IJON
2007
184views more  IJON 2007»
13 years 5 months ago
Convex incremental extreme learning machine
Unlike the conventional neural network theories and implementations, Huang et al. [Universal approximation using incremental constructive feedforward networks with random hidden n...
Guang-Bin Huang, Lei Chen
JMLR
2006
124views more  JMLR 2006»
13 years 5 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos