Search Sciweavers | Sciweavers

35 search results - page 6 / 7

» Balancing Multiple Sources of Reward in Reinforcement Learni...

click to vote

AAAI
2011

159views Intelligent Agents» more AAAI 2011»

Learned Behaviors of Multiple Autonomous Agents in Smart Grid Markets

12 years 6 months ago

Download www.cs.cmu.edu

One proposed approach to managing a large complex Smart Grid is through Broker Agents who buy electrical power from distributed producers, and also sell power to consumers, via a ...

Prashant P. Reddy, Manuela M. Veloso

claim paper

Read More »

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

13 years 1 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

13 years 4 months ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

13 years 5 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

click to vote

AOIS
2004

171views Intelligent Agents» more AOIS 2004»

Market-Based Recommender Systems: Learning Users' Interests by Quality Classification

13 years 7 months ago

Download eprints.ecs.soton.ac.uk

Recommender systems are widely used to cope with the problem of information overload and, consequently, many recommendation methods have been developed. However, no one technique i...

Yan Zheng Wei, Luc Moreau, Nicholas R. Jennings

claim paper

Read More »

« Prev « First page 6 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers