Sciweavers

1863 search results - page 60 / 373
» Multiagent learning using a variable learning rate
Sort
View
ML
2000
ACM
126views Machine Learning» more  ML 2000»
14 years 9 months ago
Learning to Play Chess Using Temporal Differences
In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...
Jonathan Baxter, Andrew Tridgell, Lex Weaver
NIPS
2003
14 years 11 months ago
Learning Curves for Stochastic Gradient Descent in Linear Feedforward Networks
Gradient-following learning methods can encounter problems of implementation in many applications, and stochastic variants are frequently used to overcome these difficulties. We ...
Justin Werfel, Xiaohui Xie, H. Sebastian Seung
ATAL
2005
Springer
15 years 3 months ago
Approximating state estimation in multiagent settings using particle filters
State estimation consists of updating an agent’s belief given executed actions and observed evidence to date. In single agent environments, the state estimation can be formalize...
Prashant Doshi, Piotr J. Gmytrasiewicz
GLOBECOM
2008
IEEE
14 years 10 months ago
Autonomous Network Management Using Cooperative Learning for Network-Wide Load Balancing in Heterogeneous Networks
Traditional hop-by-hop dynamic routing makes inefficient use of network resources as it forwards packets along already congested shortest paths while uncongested longer paths may b...
Minsoo Lee, Xiaohui Ye, Dan Marconett, Samuel John...
ECTEL
2010
Springer
14 years 8 months ago
Who Students Interact With? A Social Network Analysis Perspective on the Use of Twitter in Language Learning
Abstract. This paper reports student interaction patterns and self-reported results of using Twitter microblogging environment. The study employs longitudinal probabilistic social ...
Carsten Ullrich, Kerstin Borau, Karen Stepanyan