Search Sciweavers | Sciweavers

1863 search results - page 2 / 373

» Multiagent learning using a variable learning rate

click to vote

ICMAS
1998

157views Intelligent Agents» more ICMAS 1998»

The Moving Target Function Problem in Multi-Agent Learning

13 years 6 months ago

Download jmvidal.cse.sc.edu

We describe a framework that can be used to model and predict the behavior of MASs with learning agents. It uses a difference equation for calculating the progression of an agent&...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

13 years 6 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Bounding the False Discovery Rate in Local Bayesian Network Learning

13 years 7 months ago

Download clopinet.com

Modern Bayesian Network learning algorithms are timeefficient, scalable and produce high-quality models; these algorithms feature prominently in decision support model development...

Ioannis Tsamardinos, Laura E. Brown

claim paper

Read More »

click to vote

ICML
2000
IEEE

169views Machine Learning» more ICML 2000»

Rates of Convergence for Variable Resolution Schemes in Optimal Control

14 years 5 months ago

Download sequel.futurs.inria.fr

This paper presents a general method to derive tight rates of convergence for numerical approximations in optimal control when we consider variable resolution grids. We study the ...

Andrew W. Moore, Rémi Munos

claim paper

Read More »

click to vote

FSS
2006

114views more FSS 2006»

Fuzzy logic based variable step size algorithm for blind delayed source separation

13 years 4 months ago

Download www.ece.uic.edu

Convergence of blind delayed source separation algorithms, which use constant learning rates, is known to be slow. We propose a fuzzy logic based approach to adaptively select the...

Vivek Nigam, Roland Priemer

claim paper

Read More »

« Prev « First page 2 / 373 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers