Sciweavers

87
Voted
CORR
2002
Springer
94views Education» more  CORR 2002»
15 years 7 days ago
Self-Optimizing and Pareto-Optimal Policies in General Environments based on Bayes-Mixtures
The problem of making sequential decisions in unknown probabilistic environments is studied. In cycle t action yt results in perception xt and reward rt, where all quantities in g...
Marcus Hutter