Poker-AI.org Poker AI and Botting Discussion Forum 2013-12-02T20:38:23+00:00 http://poker-ai.org/phpbb/feed.php?f=24&t=2645 2013-12-02T20:38:23+00:00 2013-12-02T20:38:23+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5385#p5385 <![CDATA[Re: Calculate Exploitability and convergence]]> flopnflush wrote:

You can look at amax code to get an idea:
http://www.poker-ai.org/archive/www.pok ... 335#p40335

If you use perfect recall buckets I would recommend you to start by writing a recursive best response function. You can use precalculated bucket vs bucket ev lookup tables to speed it up. The unabstracted best response can also be calculated recursively, but that might be very slow. Implementing best response within an imperfect recall abstraction is tricky and I haven't done this yet.

Btw the sampling method of your cfrm algorithm doesn't matter. We don't use sampling when we calculate the best response. At least I haven't seen anyone doing this, but it could be possible.


I've got very loose imperfect recall. I'll check the code out anyway.

i've got an idea about building lookup tables to help best response calcs as I do the CFRM recursion. Need to look into it more, cheers.

Statistics: Posted by fraction — Mon Dec 02, 2013 8:38 pm


]]>
2013-12-02T18:43:19+00:00 2013-12-02T18:43:19+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5383#p5383 <![CDATA[Re: Calculate Exploitability and convergence]]> http://www.poker-ai.org/archive/www.pok ... 335#p40335

If you use perfect recall buckets I would recommend you to start by writing a recursive best response function. You can use precalculated bucket vs bucket ev lookup tables to speed it up. The unabstracted best response can also be calculated recursively, but that might be very slow. Implementing best response within an imperfect recall abstraction is tricky and I haven't done this yet.

Btw the sampling method of your cfrm algorithm doesn't matter. We don't use sampling when we calculate the best response. At least I haven't seen anyone doing this, but it could be possible.

Statistics: Posted by flopnflush — Mon Dec 02, 2013 6:43 pm


]]>
2013-12-02T17:45:43+00:00 2013-12-02T17:45:43+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5380#p5380 <![CDATA[Re: Calculate Exploitability and convergence]]> flopnflush wrote:

What kind of bucketing method do you use. And do you want to find the best response within your full abstraction or the best response within your betting-abstraction but with unabstracted cards?

Cheers for the response.

My bucketing is really simple at the moment. It's just EHS buckets based on pokerstove like rollouts vs random hands and my betting is unabstracted (it's limit). I'd be happy to find out it's best response within it's own abstraction, just to check if it's converging but I'd like to be able to check it's unabstracted best response if possible.

Statistics: Posted by fraction — Mon Dec 02, 2013 5:45 pm


]]>
2013-12-02T17:31:16+00:00 2013-12-02T17:31:16+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5379#p5379 <![CDATA[Re: Calculate Exploitability and convergence]]> Statistics: Posted by flopnflush — Mon Dec 02, 2013 5:31 pm


]]>
2013-12-02T15:23:00+00:00 2013-12-02T15:23:00+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5375#p5375 <![CDATA[Re: Calculate Exploitability and convergence]]>
My (probably bad) current understanding is: One player uses the CFRM strategy to make decisions, the other player uses best response strategy to make decisions. Exploitability is profit of best response. So far so good I think. I fall down on how to calculate best response. I know the CFRM strategy should be available to the best response. But in my mind I can't do it without turning the CFRM players hand face-up. (or is that the idea here :? )

Can anyone point me in the right direction or explain it in, er, non-greek terms?

Statistics: Posted by fraction — Mon Dec 02, 2013 3:23 pm


]]>
2013-12-02T15:15:17+00:00 2013-12-02T15:15:17+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5373#p5373 <![CDATA[Re: Calculate Exploitability and convergence]]>
My (probably bad) current understanding is: One player uses the CFRM strategy to make decisions, the other player uses best response strategy to make decisions. Exploitability is profit of best response. So far so good I think. I fall down on how to calculate best response. I know the CFRM strategy should be available to the best response. But in my mind I can't do it without turning the CFRM players hand face-up.

Can anyone point me in the right direction or explain it in, er, non-greek terms?

Statistics: Posted by fraction — Mon Dec 02, 2013 3:15 pm


]]>
2013-11-18T16:56:46+00:00 2013-11-18T16:56:46+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5289#p5289 <![CDATA[Re: Calculate Exploitability and convergence]]> Statistics: Posted by cantina — Mon Nov 18, 2013 4:56 pm


]]>
2013-11-18T15:10:09+00:00 2013-11-18T15:10:09+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5288#p5288 <![CDATA[Re: Calculate Exploitability and convergence]]>

Any idea for exploitability ? is it linked to the convergence ? And how should I get it...

Statistics: Posted by MrNice — Mon Nov 18, 2013 3:10 pm


]]>
2013-11-18T13:59:04+00:00 2013-11-18T13:59:04+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5287#p5287 <![CDATA[Re: Calculate Exploitability and convergence]]> Statistics: Posted by flopnflush — Mon Nov 18, 2013 1:59 pm


]]>
2013-11-18T11:44:07+00:00 2013-11-18T11:44:07+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5286#p5286 <![CDATA[Re: Calculate Exploitability and convergence]]> Statistics: Posted by MrNice — Mon Nov 18, 2013 11:44 am


]]>
2013-11-18T09:27:16+00:00 2013-11-18T09:27:16+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5284#p5284 <![CDATA[Re: Calculate Exploitability and convergence]]> Statistics: Posted by flopnflush — Mon Nov 18, 2013 9:27 am


]]>
2013-11-18T08:47:24+00:00 2013-11-18T08:47:24+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2645&p=5283#p5283 <![CDATA[Calculate Exploitability and convergence]]>
I'm working on CFRM-CS for FLHU....

I was wondering at which values should I look to see if my implementation is converging...

And by the way how should I measure the exploitability ... Should I calculate Best Response for both strategies and compare/sum/sub ?

Thanks for your help.

MrNice

Statistics: Posted by MrNice — Mon Nov 18, 2013 8:47 am


]]>