Sciweavers

97 search results - page 6 / 20
» Herding dynamical weights to learn
Sort
View
ATAL
2008
Springer
15 years 1 months ago
Non-linear dynamics in multiagent reinforcement learning algorithms
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...
Sherief Abdallah, Victor R. Lesser
ESANN
2008
15 years 1 months ago
Learning Inverse Dynamics: a Comparison
While it is well-known that model can enhance the control performance in terms of precision or energy efficiency, the practical application has often been limited by the complexiti...
Duy Nguyen-Tuong, Jan Peters, Matthias Seeger, Ber...
ML
1998
ACM
136views Machine Learning» more  ML 1998»
14 years 11 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
WWW
2005
ACM
16 years 12 days ago
Adaptive filtering of advertisements on web pages
We present a browser extension to dynamically learn to filter unwanted images (such as advertisements or flashy graphics) based on minimal user feedback. To do so, we apply the we...
Babak Esfandiari, Richard Nock
78
Voted
INFOCOM
2010
IEEE
14 years 10 months ago
Throughput-Optimal Opportunistic Scheduling in the Presence of Flow-Level Dynamics
Abstract—We consider multiuser scheduling in wireless networks with channel variations and flow-level dynamics. Recently, it has been shown that the MaxWeight algorithm, which i...
Shihuan Liu, Lei Ying, R. Srikant