Sciweavers

168 search results - page 30 / 34
» Optimism in Reinforcement Learning Based on Kullback-Leibler...
Sort
View
IJCAI
2003
14 years 10 months ago
An Integrated Multilevel Learning Approach to Multiagent Coalition Formation
In this paper we describe an integrated multilevel learning approach to multiagent coalition formation in a real-time environment. In our domain, agents negotiate to form teams to...
Leen-Kiat Soh, Xin Li
GLOBECOM
2008
IEEE
14 years 9 months ago
Autonomous Network Management Using Cooperative Learning for Network-Wide Load Balancing in Heterogeneous Networks
Traditional hop-by-hop dynamic routing makes inefficient use of network resources as it forwards packets along already congested shortest paths while uncongested longer paths may b...
Minsoo Lee, Xiaohui Ye, Dan Marconett, Samuel John...
ICRA
2010
IEEE
143views Robotics» more  ICRA 2010»
14 years 8 months ago
Apprenticeship learning via soft local homomorphisms
Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...
Abdeslam Boularias, Brahim Chaib-draa
ADHOCNETS
2010
Springer
14 years 6 months ago
DCLA: A Duty-Cycle Learning Algorithm for IEEE 802.15.4 Beacon-Enabled WSNs
The current specification for IEEE 802.15.4 beacon-enabled networks does not define how active and sleep schedules should be configured in order to achieve the optimal network perf...
Rodolfo de Paz Alberola, Dirk Pesch
KDD
2010
ACM
289views Data Mining» more  KDD 2010»
14 years 7 months ago
Exploitation and exploration in a performance based contextual advertising system
The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...
Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...