Sciweavers

97 search results - page 6 / 20
» Herding dynamical weights to learn
Sort
View
104
Voted
ATAL
2008
Springer
15 years 3 months ago
Non-linear dynamics in multiagent reinforcement learning algorithms
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...
Sherief Abdallah, Victor R. Lesser
ESANN
2008
15 years 3 months ago
Learning Inverse Dynamics: a Comparison
While it is well-known that model can enhance the control performance in terms of precision or energy efficiency, the practical application has often been limited by the complexiti...
Duy Nguyen-Tuong, Jan Peters, Matthias Seeger, Ber...
113
Voted
ML
1998
ACM
136views Machine Learning» more  ML 1998»
15 years 1 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
WWW
2005
ACM
16 years 2 months ago
Adaptive filtering of advertisements on web pages
We present a browser extension to dynamically learn to filter unwanted images (such as advertisements or flashy graphics) based on minimal user feedback. To do so, we apply the we...
Babak Esfandiari, Richard Nock
INFOCOM
2010
IEEE
15 years 9 days ago
Throughput-Optimal Opportunistic Scheduling in the Presence of Flow-Level Dynamics
Abstract—We consider multiuser scheduling in wireless networks with channel variations and flow-level dynamics. Recently, it has been shown that the MaxWeight algorithm, which i...
Shihuan Liu, Lei Ying, R. Srikant