Poker-AI.org Poker AI and Botting Discussion Forum 2016-02-13T22:49:23+00:00 http://poker-ai.org/phpbb/feed.php?f=24&t=2902 2016-02-13T22:49:23+00:00 2016-02-13T22:49:23+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6915#p6915 <![CDATA[Re: CFR with optimal risk level]]> Statistics: Posted by eiisolver — Sat Feb 13, 2016 10:49 pm


]]>
2016-02-10T18:57:36+00:00 2016-02-10T18:57:36+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6912#p6912 <![CDATA[Re: CFR with optimal risk level]]> Statistics: Posted by spears — Wed Feb 10, 2016 6:57 pm


]]>
2016-02-10T09:24:00+00:00 2016-02-10T09:24:00+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6911#p6911 <![CDATA[Re: CFR with optimal risk level]]>
I am currently revisiting CFR in a different context where I suspect there is a mean-variance tradeoff and my question again - how to make CFR biased towards actions which would make the whole payoff structure less variable even if resulting in lower EV.

I am now experimenting again in the HU poker context how to achieve this. The way I see there are two extremes:
1) EQ strat which maximises utility irrespective of variance
2) minimum variance strategy which minimizes variance irrespective of utility. Example could be for the first player to always fold thus resulting in a fixed payoff and variance of 0

The first one we can already achieve with the regular CFR, but how to do the 2nd one? I suspect that I either should meddle with the payoffs at the bottom of the tree or the regret updates, but I am slightly puzzled on how to compute variance from a single traversal...

What are your thoughs on this?

Statistics: Posted by DreamInBinary — Wed Feb 10, 2016 9:24 am


]]>
2015-05-02T07:15:20+00:00 2015-05-02T07:15:20+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6653#p6653 <![CDATA[Re: CFR with optimal risk level]]>

I have looked into DBR, but with limited success so far and imho DBR is about just EV.

You are right that in investment risk-reward is correlated, but I came to the conclusion poker building on the following argument - tight players experience smaller variance than loose players. Now say you have EQ strategy and you make two equally good adjusted strategies - one loose and one tight. If you play the latters against EQ I would expect that both will underperform, but with difference variance levels.

Now that I think I probably want to just make CFR generate a strategy with adjustable "tightness"


Cheers,
DIB

Statistics: Posted by DreamInBinary — Sat May 02, 2015 7:15 am


]]>
2015-05-01T22:54:04+00:00 2015-05-01T22:54:04+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6652#p6652 <![CDATA[Re: CFR with optimal risk level]]> http://poker-ai.org/archive/www.pokerai ... =79&t=4514 is probably quite a good place to start

Statistics: Posted by spears — Fri May 01, 2015 10:54 pm


]]>
2015-05-01T17:11:23+00:00 2015-05-01T17:11:23+00:00 http://poker-ai.org/phpbb/viewtopic.php?t=2902&p=6651#p6651 <![CDATA[CFR with optimal risk level]]>

Afaik CFR goal is to find a strategy which has the highest EV vs its worst-case/BR opponent. I come from an investment background where decisions are made based on risk-reward ratios. I am wondering whether there is a way to adjust CFR so it would exchange part of the EV for a smaller variance ( preferably specifying a tradeoff between std and EV ).

Firstly, I was thinking of replacing the outcome with some risk adjusted version of it - say outcome/standard deviation. The problem is how to track the std and if it makes sense at all.

Supposing that I am able to do some quick and dirty fix on the previous point I started thinking how to test it. The first thing that came to my mind is to play EQ vs Risk adjusted EQ, but then its obv that EQ will win and risk-adjusted will lose :) I could compare it to EQ vs EQ but that somehow feels a bit off.

Having solved these two issues I am planning to dedicate some of my time to it and am willing to share the results!


Cheers,
DIB

PS. I am assuming HUNL here.

Statistics: Posted by DreamInBinary — Fri May 01, 2015 5:11 pm


]]>