Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games

8 years 8 days ago

Download www.aamas2015.com

Online search in games has been a core interest of artiﬁcial intelligence. Search in imperfect information games (e.g., Poker, Bridge, Skat) is particularly challenging due to the complexities introduced by hidden information. In this paper, we present Online Outcome Sampling, an online search variant of Monte Carlo Counterfactual Regret Minimization, which preserves its convergence to Nash equilibrium. We show that OOS can overcome the problem of non-locality encountered by previous search algorithms and perform well against its worst-case opponents. We show that exploitability of the strategies played by OOS decreases as the amount of search time increases, and that preexisting Information Set Monte Carlo tree search (ISMCTS) can get more exploitable over time. In head-to-head play, OOS outperforms ISMCTS in games where non-locality plays a signiﬁcant role, given a suﬃcient computation time per move. Categories and Subject Descriptors I.2.1 [Artiﬁcial Intelligence]: Applicat...

Viliam Lisý, Marc Lanctot, Michael H. Bowli

Real-time Traffic

ATAL 2015 | Intelligent Agents |

claim paper

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	ATAL
Authors	Viliam Lisý, Marc Lanctot, Michael H. Bowling

Sciweavers

Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games

ATAL 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers