Generalized Sampling and Variance in Counterfactual Regret Minimization

13 years 2 months ago

Download poker.cs.ualberta.ca

In large extensive form games with imperfect information, Counterfactual Regret Minimization (CFR) is a popular, iterative algorithm for computing approximate Nash equilibria. While the base algorithm performs a full tree traversal on each iteration, Monte Carlo CFR (MCCFR) reduces the per iteration time cost by traversing just a sampled portion of the tree. On the other hand, MCCFR’s sampled values introduce variance, and the effects of this variance were previously unknown. In this paper, we generalize MCCFR by considering any generic estimator of the sought values. We show that any choice of an estimator can be used to probabilistically minimize regret, provided the estimator is bounded and unbiased. In addition, we relate the variance of the estimator to the convergence rate of an algorithm that calculates regret directly from the estimator. We demonstrate the application of our analysis by deﬁning a new bounded, unbiased estimator with empirically lower variance than MCCFR es...

Richard G. Gibson, Marc Lanctot, Neil Burch, Duane

Real-time Traffic

AAAI 2012 | Intelligent Agents | Iterative Algorithm | Sampling Schemes | Tree Traversal |

claim paper

» Regret Bounds for Prediction Problems

» Empirical Bernstein Boosting

» Generalized multicircumcenter trajectories for optimal design under nearindependence

» Breaking the simulation barrier SRAM evaluation through norm minimization

» Vote Elicitation with Probabilistic Preference Models Empirical Estimation and Cost Tradeo...

» Genetic Programming Validation Sets and Parsimony Pressure

Post Info
More Details (n/a)

Added	29 Sep 2012
Updated	29 Sep 2012
Type	Journal
Year	2012
Where	AAAI
Authors	Richard G. Gibson, Marc Lanctot, Neil Burch, Duane Szafron, Michael Bowling

Comments (0)

Sciweavers

Generalized Sampling and Variance in Counterfactual Regret Minimization

AAAI 2012 | Intelligent Agents | Iterative Algorithm | Sampling Schemes | Tree Traversal |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers