Poker-AI.org

Poker AI and Botting Discussion Forum
It is currently Mon Nov 13, 2023 5:34 pm

All times are UTC




Post new topic Reply to topic  [ 12 posts ] 
Author Message
PostPosted: Mon Nov 18, 2013 8:47 am 
Offline
Junior Member

Joined: Wed Sep 04, 2013 6:05 pm
Posts: 47
Hi Guyz,

I'm working on CFRM-CS for FLHU....

I was wondering at which values should I look to see if my implementation is converging...

And by the way how should I measure the exploitability ... Should I calculate Best Response for both strategies and compare/sum/sub ?

Thanks for your help.

MrNice


Top
 Profile  
 
PostPosted: Mon Nov 18, 2013 9:27 am 
Offline
Junior Member

Joined: Sat Nov 02, 2013 2:21 pm
Posts: 26
Afair the sum of the best responses should converge to zero.


Top
 Profile  
 
PostPosted: Mon Nov 18, 2013 11:44 am 
Offline
Junior Member

Joined: Wed Sep 04, 2013 6:05 pm
Posts: 47
Meaning that the implementation is converging right ?


Top
 Profile  
 
PostPosted: Mon Nov 18, 2013 1:59 pm 
Offline
Junior Member

Joined: Sat Nov 02, 2013 2:21 pm
Posts: 26
yes!


Top
 Profile  
 
PostPosted: Mon Nov 18, 2013 3:10 pm 
Offline
Junior Member

Joined: Wed Sep 04, 2013 6:05 pm
Posts: 47
oki thanks :D

Any idea for exploitability ? is it linked to the convergence ? And how should I get it...


Top
 Profile  
 
PostPosted: Mon Nov 18, 2013 4:56 pm 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
I think exploitability is the sum of the best responses. I'd be curious to see a heuristic that estimates this faster.


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 3:15 pm 
Offline
Junior Member

Joined: Mon Dec 02, 2013 3:02 pm
Posts: 19
I'm having real trouble getting my head around how to calculate a best response to my CFRM generated strategy. I've read the accelerated BR paper and I can see how it might be done with PCS, but I'm using plain old CS.

My (probably bad) current understanding is: One player uses the CFRM strategy to make decisions, the other player uses best response strategy to make decisions. Exploitability is profit of best response. So far so good I think. I fall down on how to calculate best response. I know the CFRM strategy should be available to the best response. But in my mind I can't do it without turning the CFRM players hand face-up.

Can anyone point me in the right direction or explain it in, er, non-greek terms?


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 3:23 pm 
Offline
Junior Member

Joined: Mon Dec 02, 2013 3:02 pm
Posts: 19
I'm having real trouble getting my head around how to calculate a best response to my CFRM generated strategy. I've read the accelerated BR paper and I can almost see how it might be done with PCS, but I'm using plain old CS.

My (probably bad) current understanding is: One player uses the CFRM strategy to make decisions, the other player uses best response strategy to make decisions. Exploitability is profit of best response. So far so good I think. I fall down on how to calculate best response. I know the CFRM strategy should be available to the best response. But in my mind I can't do it without turning the CFRM players hand face-up. (or is that the idea here :? )

Can anyone point me in the right direction or explain it in, er, non-greek terms?


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 5:31 pm 
Offline
Junior Member

Joined: Sat Nov 02, 2013 2:21 pm
Posts: 26
What kind of bucketing method do you use. And do you want to find the best response within your full abstraction or the best response within your betting-abstraction but with unabstracted cards?


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 5:45 pm 
Offline
Junior Member

Joined: Mon Dec 02, 2013 3:02 pm
Posts: 19
flopnflush wrote:
What kind of bucketing method do you use. And do you want to find the best response within your full abstraction or the best response within your betting-abstraction but with unabstracted cards?

Cheers for the response.

My bucketing is really simple at the moment. It's just EHS buckets based on pokerstove like rollouts vs random hands and my betting is unabstracted (it's limit). I'd be happy to find out it's best response within it's own abstraction, just to check if it's converging but I'd like to be able to check it's unabstracted best response if possible.


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 6:43 pm 
Offline
Junior Member

Joined: Sat Nov 02, 2013 2:21 pm
Posts: 26
You can look at amax code to get an idea:
http://www.poker-ai.org/archive/www.pok ... 335#p40335

If you use perfect recall buckets I would recommend you to start by writing a recursive best response function. You can use precalculated bucket vs bucket ev lookup tables to speed it up. The unabstracted best response can also be calculated recursively, but that might be very slow. Implementing best response within an imperfect recall abstraction is tricky and I haven't done this yet.

Btw the sampling method of your cfrm algorithm doesn't matter. We don't use sampling when we calculate the best response. At least I haven't seen anyone doing this, but it could be possible.


Top
 Profile  
 
PostPosted: Mon Dec 02, 2013 8:38 pm 
Offline
Junior Member

Joined: Mon Dec 02, 2013 3:02 pm
Posts: 19
flopnflush wrote:
You can look at amax code to get an idea:
http://www.poker-ai.org/archive/www.pok ... 335#p40335

If you use perfect recall buckets I would recommend you to start by writing a recursive best response function. You can use precalculated bucket vs bucket ev lookup tables to speed it up. The unabstracted best response can also be calculated recursively, but that might be very slow. Implementing best response within an imperfect recall abstraction is tricky and I haven't done this yet.

Btw the sampling method of your cfrm algorithm doesn't matter. We don't use sampling when we calculate the best response. At least I haven't seen anyone doing this, but it could be possible.


I've got very loose imperfect recall. I'll check the code out anyway.

i've got an idea about building lookup tables to help best response calcs as I do the CFRM recursion. Need to look into it more, cheers.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 12 posts ] 

All times are UTC


Who is online

Users browsing this forum: Google [Bot] and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Powered by phpBB® Forum Software © phpBB Group