Poker-AI.org

Poker AI and Botting Discussion Forum
It is currently Mon Nov 13, 2023 2:18 pm

All times are UTC




Post new topic Reply to topic  [ 2 posts ] 
Author Message
PostPosted: Tue Dec 10, 2013 5:17 am 
Offline
Junior Member

Joined: Thu May 23, 2013 11:35 pm
Posts: 23
Hi poker-ai

I tried to modify the monte carlo cfrm algorithm for kuhn poker by not sampling randomly but rather iterating QvK 1M times, followed by QvA 1M times .. and so on for every pair of hands. It didn't converge to the Nash equilibrium. Does this mean the sampling method has to be "random looking" in some way?
I then tried iterating over each holecard pair but perfectly interspersed. i.e. (QK,QA,KQ,KA,AQ,AK)x1000000
and it did converge to the nash equilibrium.

For real poker would it be OK to iterate in this way?
for mycards = 22,23,... AA
for oppcards = 22,23,... AA
for flop = 222,223,... AAA
....

it seems it might not converge as well since the p1 hole card are not interspersed perfectly. though I do not know why it does not converge perfectly in that situation. Anyone have an idea ?

The reason I want to do this is to get perfectly weighted representations for each scenario instead of using a PRNG with no guarantees.

algonoob


Top
 Profile  
 
PostPosted: Tue Dec 10, 2013 7:04 am 
Offline
Site Admin
User avatar

Joined: Sun Feb 24, 2013 9:39 pm
Posts: 642
This is difficult to put into words. In each iteration of Fictitious Play one player plays his best response to the other's equilibrium response, and then the players change places. The best response strategies are averaged over all time to form the equilibrium response. So if the same cards are played in successive iterations as you describe it's like you have very few iterations so convergence is very slow. CFRM is similar.

Quote:
For real poker would it be OK to iterate in this way?
for mycards = 22,23,... AA
for oppcards = 22,23,... AA
for flop = 222,223,... AAA


I'm 99% sure this would converge slower than a random ordering.

Quote:
The reason I want to do this is to get perfectly weighted representations for each scenario instead of using a PRNG with no guarantees.

I suspect that for a "perfectly weighted representation" you would need the answer before you start. But I think you might be able to bias the sampling to the cards which matter more to the solution. I know no more than that.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 2 posts ] 

All times are UTC


Who is online

Users browsing this forum: No registered users and 2 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Powered by phpBB® Forum Software © phpBB Group