Sciweavers

138 search results - page 7 / 28
» Keeping diversity when exploring dynamic environments
Sort
View
COLT
2008
Springer
14 years 12 months ago
Adapting to a Changing Environment: the Brownian Restless Bandits
In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...
Aleksandrs Slivkins, Eli Upfal
NSDI
2010
14 years 11 months ago
Scalable WiFi Media Delivery through Adaptive Broadcasts
Current WiFi Access Points (APs) choose transmission parameters when emitting wireless packets based solely on channel conditions. In this work we explore the benefits of deciding...
Sayandeep Sen, Neel Kamal Madabhushi, Suman Banerj...
ROBOCUP
2001
Springer
96views Robotics» more  ROBOCUP 2001»
15 years 2 months ago
Strategy Learning for a Team in Adversary Environments
Team strategy acquisition is one of the most important issues of multiagent systems, especially in an adversary environment. RoboCup has been providing such an environment for AI a...
Yasutake Takahashi, Takashi Tamura, Minoru Asada
ICML
2007
IEEE
15 years 11 months ago
Percentile optimization in uncertain Markov decision processes with application to efficient exploration
Markov decision processes are an effective tool in modeling decision-making in uncertain dynamic environments. Since the parameters of these models are typically estimated from da...
Erick Delage, Shie Mannor
ICDCS
1997
IEEE
15 years 2 months ago
Load Profiling In Distributed Real-Time Systems
Load balancing is often used to ensure that nodes in a distributed systems are equally loaded. In this paper, we show that for real-time systems, load balancing is not desirable. ...
Azer Bestavros