Search Sciweavers | Sciweavers

73 search results - page 15 / 15

» Stochastic Linear Optimization under Bandit Feedback

click to vote

ESANN
2006

140views Neural Networks» more ESANN 2006»

Magnification control for batch neural gas

13 years 6 months ago

Download www2.in.tu-clausthal.de

Neural gas (NG) constitutes a very robust clustering algorithm which can be derived as stochastic gradient descent from a cost function closely connected to the quantization error...

Barbara Hammer, Alexander Hasenfuss, Thomas Villma...

claim paper

Read More »

click to vote

GLOBECOM
2010
IEEE

156views Communications» more GLOBECOM 2010»

Online Network Coding for Time-Division Duplexing

13 years 3 months ago

Download www.mit.edu

We study an online random linear network coding approach for time division duplexing (TDD) channels under Poisson arrivals. We model the system as a bulk-service queue with variabl...

Daniel Enrique Lucani, Muriel Médard, Milic...

claim paper

Read More »

click to vote

JAIR
2008

119views more JAIR 2008»

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics

13 years 5 months ago

Download www.ece.utk.edu

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

« Prev « First page 15 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers