Sciweavers

35 search results - page 6 / 7
» Balancing Multiple Sources of Reward in Reinforcement Learni...
Sort
View
AAAI
2011
12 years 6 months ago
Learned Behaviors of Multiple Autonomous Agents in Smart Grid Markets
One proposed approach to managing a large complex Smart Grid is through Broker Agents who buy electrical power from distributed producers, and also sell power to consumers, via a ...
Prashant P. Reddy, Manuela M. Veloso
COGSR
2011
71views more  COGSR 2011»
13 years 1 months ago
Psychological models of human and optimal performance in bandit problems
In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a fixed but unknown rate of reward, to maximize their total number of rewards ov...
Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...
JSAC
2010
107views more  JSAC 2010»
13 years 4 months ago
Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications
Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...
Hsien-Po Shiang, Mihaela van der Schaar
ML
2002
ACM
133views Machine Learning» more  ML 2002»
13 years 5 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
AOIS
2004
13 years 7 months ago
Market-Based Recommender Systems: Learning Users' Interests by Quality Classification
Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...
Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings