Poker-AI.org • View topic - The memory dilemma

View unanswered posts | View active topics

Board index » Public Forums » AI Research

All times are UTC

The memory dilemma

Page 1 of 1

[ 4 posts ]

Print view

Previous topic | Next topic

Author

Message

proud2bBot

Post subject: The memory dilemma

Posted: Wed Mar 13, 2013 8:52 pm

Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216

Lets assume we want to solve 25bb NLH HU games using CFRM. Even with this small stack size, we still have to apply tons of abstractions in the following areas to keep the tree size managable:
1 - Holecard-Board abstraction (how do we model our hand [and the board] = size of a node)
2 - Betting abstraction (how many bet sizes and raising depth do we allow = depth of the tree)

I'm currently experimenting with varying in both areas, but facing two major problems:
A) evaluating takes a long time as we need to calculate a model until convergence and play the 2 models against each other
B) the factors may be interdependent (e.g. the effect of allowing more betting options might be minimal if we have few hole card buckets but significant if we have a fine-grained holecard/board abstraction)

Does anyone have an idea how we can evaluate these things better? If not - and you are solving a similar game - how do your abstractions look like?

Mine is as follows:
- 169 preflop buckets and 200 buckets on flop/turn/river based on EHS2
- allowing 4 different bet sizes and all-ins, with maximum of 5 raises per game (over all streets)

I'm changing the card/board abstraction however to a non-EHS2 based one, which should be way better given we have the same number of buckets (at least the first results for the flop indicate it, as now on a board like Td8d5s, hands like Jd9d, AsAh and 8h5h are all in the same bucket, while EHS2 doesn't value monster draws too much).

Top

cantina

Post subject: Re: The memory dilemma

Posted: Wed Mar 13, 2013 11:50 pm

Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437

I've experimented with some unorthodox approaches, like using NNs to 'store' the cumulative regrets. It converged, to a point, but nowhere near anything useful. I had other unorthodox ideas as well, though I'm not as motivated to spend time implementing them. Spears is working on an approach along the same lines, with more success.

As far as my abstractions, I'm working on one where the betting nodes are generated by player data.

Top

proud2bBot

Post subject: Re: The memory dilemma

Posted: Thu Mar 14, 2013 2:10 am

Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216

Nasher wrote:

As far as my abstractions, I'm working on one where the betting nodes are generated by player data.

What do you mean by this? Are you creating nodes based on mined "typical" bet sizes for different histories or do you restrict actions based on an opponent model (i.e., most villains only play x%, so we force them to fold 100-x%)?

Top

cantina

Post subject: Re: The memory dilemma

Posted: Thu Mar 14, 2013 10:34 pm

Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437

I have player "clones" and I'm using the various nodes that they reached through play to build the abstraction. 16 bet sizes in total as % of pot.

Top

Page 1 of 1

[ 4 posts ]

Board index » Public Forums » AI Research

All times are UTC

Who is online

Users browsing this forum: No registered users and 1 guest

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum