Poker-AI.org Poker AI and Botting Discussion Forum 2023-01-16T20:40:43+00:00 http://poker-ai.org/phpbb/feed.php?f=24&t=3541 2023-01-16T20:40:43+00:00 2023-01-16T20:40:43+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=3541&p=9775#p9775 <![CDATA[Re: Possible ways to approach a 9-handed poker AI]]> Statistics: Posted by DavyBlaid — Mon Jan 16, 2023 8:40 pm


]]>
2022-05-07T13:08:36+00:00 2022-05-07T13:08:36+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=3541&p=9081#p9081 <![CDATA[Re: Possible ways to approach a 9-handed poker AI]]> Statistics: Posted by zggcd — Sat May 07, 2022 1:08 pm


]]>
2022-04-24T15:56:17+00:00 2022-04-24T15:56:17+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=3541&p=9071#p9071 <![CDATA[Re: Possible ways to approach a 9-handed poker AI]]> Statistics: Posted by listerofsmeg — Sun Apr 24, 2022 3:56 pm


]]>
2022-04-17T18:19:25+00:00 2022-04-17T18:19:25+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=3541&p=9066#p9066 <![CDATA[Re: Possible ways to approach a 9-handed poker AI]]>
Create a strategy with a bunch of parameters, let's say a 1000. The strategy would be designed in a way that some choice of parameters would result in a strategy that is "good", i.e. not very exploitable.

For example, there could be a bunch of rules such as - on the flop in position, value bet hands with p_1 % equity against opponent's range and bluff p_2 % of our range.

But the question is, how to optimize the parameters, ideally via some CFR-like algorithm?

Statistics: Posted by listerofsmeg — Sun Apr 17, 2022 6:19 pm


]]>
2022-04-09T19:36:16+00:00 2022-04-09T19:36:16+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=3541&p=9064#p9064 <![CDATA[Possible ways to approach a 9-handed poker AI]]>
It should be able to play 6-max and 9-max tables, shouldn't require a lot of computation to act and it would be nice if it could be adjusted in some way to exploit the environment.

What is a good approach?

I've been following the poker research for years but this still seems like a somewhat uncharted territory. The state-of-the art seems to be Pluribus, but it's very complex, it's 6-max, requires fixed starting stacks and uses a lot of computation resources. And the near-GTO strategy may not even be appropriate in a highly exploitable setting.

Right now I'm thinking about this, although it's a very vague idea - create some sort of a parametrized strategy template with hundreds or thousands of parameters and optimize the parameters via self-play.

What do you think?

Statistics: Posted by listerofsmeg — Sat Apr 09, 2022 7:36 pm


]]>