Poker-AI.org
http://poker-ai.org/phpbb/

CFR-D: Solving Imperfect Information Games Using Decompositi
http://poker-ai.org/phpbb/viewtopic.php?f=25&t=2433
Page 1 of 1

Author:  pumpkinpuree [ Sun Mar 31, 2013 2:57 pm ]
Post subject:  CFR-D: Solving Imperfect Information Games Using Decompositi

CFR-D: Solving Imperfect Information Games Using Decomposition
by: Neil Burch, Michael Bowling

Abstract
One of the significant advantages in problems with perfect information, like search or games like checkers, is that they can be decomposed into independent pieces. In contrast, problems with imperfect information, like market modeling or games like poker, are treated as a single decomposable whole. Handling the game as a single unit places a much stricter limit on the size of solvable imperfect information games. This paper has two main contributions. First, we introduce CFR-D, a new variant of the counterfactual regret minimising family of algorithms. For any problem which can be decomposed into a trunk and subproblems, CFR-D can handle the trunk and each subproblem independently. Decomposition lets CFR-D have memory requirements which are sub-linear in the number of decision points, a desirable property more commonly associated with perfect information algorithms. Second, we present an algorithm for recovering an equilibrium strategy in a subproblem given the trunk strategy and some summary information about the subproblem.

http://arxiv.org/pdf/1303.4441v1.pdf

Author:  cantina [ Mon Apr 01, 2013 3:44 pm ]
Post subject:  Re: CFR-D: Solving Imperfect Information Games Using Decompo

I sort-of understand the algorithm. :)

Think they're planning on solving a large, unabstracted trunk, up to maybe the turn (?), then just solving the remaining subtrees online during play at the ACPC?

Author:  LOLWorld [ Fri May 17, 2013 8:46 pm ]
Post subject:  Re: CFR-D: Solving Imperfect Information Games Using Decompo

With online computing of vanilla CFR you need to roll out hole cards for both players even if you know your hole cards.
Is it possible for this algorithm to consider our hole cards and only roll out hole cards for the opponent because of both players playing a best response?

Page 1 of 1 All times are UTC
Powered by phpBB® Forum Software © phpBB Group
http://www.phpbb.com/