Sciweavers

59
Voted
ALT
2006
Springer
15 years 5 months ago
General Discounting Versus Average Reward
Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...
Marcus Hutter