Quote:
Well, from what you say it seems to me that you have pretty much figured the problem out in the right terms, I think it might only be a question of fine tuning for you now.
Yeah the problem is that the fine tuning takes the longest to do as the little things make the biggest difference.
Quote:
I still think it is a pretty complex problem because especially on the river that decision is the result of all the process the bot is gone thru (assigning a range pre, deciding what part of the range the guy is carrying on on the flop, how much he folds, how much he herocalls, what should our bluff freq. be and so on), and this seems so far away for me as we are just starting the project, while you seem to be at a very good point in rule based decision making.
As a whole its not really complex (not like MCTS). Getting the range correct pre for me is the biggest hurdle as it obviously affects the rest of the hand and if your wrong it could really cost some big EV. As I mentioned previously the biggest problem is the bluffing ranges of the villain. Villains can have identical stats but completely different bluff ranges which has caused me some big headaches. Generally it seems most decent players will 3B bluff hands that can take a call and not just total trash. So SC's, blockers, etc.
Post flop is fairly straightforward and it comes down to basic hand combinatorics and range reduction. So of his range what % is value, air, draws. And if he is not leading the betting then what part of his range can be called etc. On each street its going to get tighter and tighter as the range he can bet for value gets smaller and smaller until your left with a very small number of hands that he could possibly reach the river with. So on the river your left with value, bluff catchers and missed draws. So now its just a matter of running his range vs hand and its gets easier to figure out what to do..either to bluff catch, bluff or value bet. But..its not just a matter of equity but also a matter of hand combinatorics to figure out how to extract max value. Like if his range is mostly missed draws you would be way better off letting him try to bluff at you vs leading out. You get the idea..