Sciweavers

83 search results - page 1 / 17
» Psychological models of human and optimal performance in ban...
Sort
View
86
Voted
COGSR
2011
71views more  COGSR 2011»
14 years 4 months ago
Psychological models of human and optimal performance in bandit problems
In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a fixed but unknown rate of reward, to maximize their total number of rewards ov...
Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...
81
Voted
ECML
2005
Springer
15 years 3 months ago
Multi-armed Bandit Algorithms and Empirical Evaluation
The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...
Joannès Vermorel, Mehryar Mohri
102
Voted
LION
2010
Springer
190views Optimization» more  LION 2010»
15 years 1 months ago
Algorithm Selection as a Bandit Problem with Unbounded Losses
Abstract. Algorithm selection is typically based on models of algorithm performance learned during a separate offline training sequence, which can be prohibitively expensive. In r...
Matteo Gagliolo, Jürgen Schmidhuber
NIPS
2008
14 years 11 months ago
Structure Learning in Human Sequential Decision-Making
We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...
Daniel Acuña, Paul R. Schrater
IADIS
2003
14 years 11 months ago
How Can Agents Help Improving the Performance of a Human Team
The contribution of intelligent agents for the human team performance is a challenging problem. This paper introduces a study to help clarifying this issue, starting with the moti...
Mauro Nunes, Henrique O'Neill