Search Sciweavers | Sciweavers

417 search results - page 23 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

100

click to vote

IWINAC
2007
Springer

130views Artificial Intelligence» more IWINAC 2007»

EDNA: Estimation of Dependency Networks Algorithm

15 years 8 months ago

Download www.cs.bham.ac.uk

One of the key points in Estimation of Distribution Algorithms (EDAs) is the learning of the probabilistic graphical model used to guide the search: the richer the model the more ...

José A. Gámez, Juan L. Mateo, Jose M...

claim paper

Read More »

115

click to vote

TNN
1998

111views more TNN 1998»

Asymptotic distributions associated to Oja's learning equation for neural networks

15 years 1 months ago

Download www-public.int-evry.fr

— In this paper, we perform a complete asymptotic performance analysis of the stochastic approximation algorithm (denoted subspace network learning algorithm) derived from Oja’...

Jean Pierre Delmas, Jean-Francois Cardos

claim paper

Read More »

120

click to vote

AGI
2008

136views Artificial Intelligence» more AGI 2008»

An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life

15 years 3 months ago

Download www.novamente.net

A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...

Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...

claim paper

Read More »

129

click to vote

CVPR
2011
IEEE

311views Computer Vision» more CVPR 2011»

Distributed Computer Vision Algorithms Through Distributed Averaging

14 years 10 months ago

Download www.cis.jhu.edu

Traditional computer vision and machine learning algorithms have been largely studied in a centralized setting, where all the processing is performed at a single central location....

Roberto Tron, René, Vidal

claim paper

Read More »

161

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 4 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

« Prev « First page 23 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers