spears wrote:
I'm wondering about multiplying by the winnings/losses at showdown. An error associated with a win is something you want to reward, and an error associated with a loss is something you want to punish.
That wasn't really in the scope of his system though, was it?
I suppose you really aren't interested in whether it's an error that's associated with a win or loss, so much as an error relative to the payoffs of the true model. So if you are against 3 possible hands each with some expected payoff for you: {H1 (50%, -5), H2 (10%, -5), H3 (40%, 0)], and your opponent model predicts incorrectly for H1 and H2, but perfectly for H3, e.g., {H1 (30%, -5), H2 (30%, -5), H3 (40%, 0)] then it doesn't matter at all since H1 and H2 are effectively the same strength. So being even 1% off in H3 is worse than 49% in H1 if the 49% is moved to H2.
All of this assumes you have some payoff function that can look ahead into the future and say, given complete info of the game, this is what you'll win. You could use HS or W% but I think those are going to be pretty poor predictors of the true payoffs of the hand given the opponent actions.
It's an interesting point though. It seems intuitive that you want to be more sensitive to error in certain hands.