Poker-AI.org

Poker AI and Botting Discussion Forum
It is currently Mon Nov 13, 2023 3:07 pm

All times are UTC




Post new topic Reply to topic  [ 2 posts ] 
Author Message
PostPosted: Mon Sep 06, 2021 5:32 pm 
Offline
Junior Member

Joined: Tue Jun 28, 2016 7:12 pm
Posts: 20
.


Last edited by FlashPlayer on Mon Oct 25, 2021 5:14 pm, edited 1 time in total.

Top
 Profile  
 
PostPosted: Tue Sep 07, 2021 8:10 am 
Offline
New Member

Joined: Fri Mar 12, 2021 3:53 pm
Posts: 9
You really should not be using vector form cfr as a base for deep cfr. Deep cfr does a bunch of external sampling mccfr iterations and then a ton (iirc 32k in the experiments) of stochastic gradient descent iterations. This is called a single deep cfr iteration.

"Intel tbb lib, SIMD, hand isomorphism, O(n) showdown calculations and other optimization hacks - all this will be needed in final realization."

If you wanted to make a really optimized implementation you would do it all on gpu (only moving the new samples to cpu to add to the memory buffer) with large batched inference calls. So then why are you using intel tbb?

What is the purpose of hand isomorphism when the network generalizes over the card embeddings?

O(n) showdown calculations are not possible if you are sampling chance (as is done in external sampling)

Anyway the real bottleneck is in training the neural network each iteration, you may aswell just use python for everything and not worry too much about optimizing the mccfr iterations

Training will also be very expensive, gl


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 2 posts ] 

All times are UTC


Who is online

Users browsing this forum: No registered users and 2 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Powered by phpBB® Forum Software © phpBB Group