I tested it with khun poker and it seems to work there at least. I'm asking because I really want to avoid implementing best response for imperfect recall bucketing holdem, because that's ugly.
Imo it totally makes sense that cfrm should converge to a best response if we train only one player. But I hope someone can confirm it? Or am I missing the obvious very easy way to test it, without calculating the real best response within the abstraction?Statistics: Posted by flopnflush — Mon May 12, 2014 8:24 am
]]>