Sciweavers

132 search results - page 27 / 27
» Rewarding Behaviors
Sort
View
MANSCI
2007
85views more  MANSCI 2007»
13 years 4 months ago
Probability Elicitation, Scoring Rules, and Competition Among Forecasters
Probability forecasters who are rewarded via a proper scoring rule may care not only about the score, but also about their performance relative to other forecasters. We model this...
Kenneth C. Lichtendahl Jr., Robert L. Winkler
ICML
1996
IEEE
14 years 5 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore