payoff functions | Sciweavers

19

CORR
2008
Springer

136views Education» more CORR 2008»

13 years 4 months ago

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of n trials so as to maximize the total payoff of the chosen strategies. While ...

Robert Kleinberg, Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

13

click to vote

CEC
2005
IEEE

99views Artificial Intelligence» more CEC 2005»

XCS with computed prediction for the learning of Boolean functions

13 years 10 months ago

Download www.eskimo.com

Computed prediction represents a major shift in learning classiﬁer system research. XCS with computed prediction, based on linear approximators, has been applied so far to functi...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

7

click to vote

LICS
2007
IEEE

121views Automated Reasoning» more LICS 2007»

Limits of Multi-Discounted Markov Decision Processes

13 years 10 months ago

Download www.labri.fr

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. The payoff received by the controller can be evaluated in different ways, dep...

Hugo Gimbert, Wieslaw Zielonka

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers