I'm having real trouble getting my head around how to calculate a best response to my CFRM generated strategy. I've read the accelerated BR paper and I can almost see how it might be done with PCS, but I'm using plain old CS.
My (probably bad) current understanding is: One player uses the CFRM strategy to make decisions, the other player uses best response strategy to make decisions. Exploitability is profit of best response. So far so good I think. I fall down on how to calculate best response. I know the CFRM strategy should be available to the best response. But in my mind I can't do it without turning the CFRM players hand face-up. (or is that the idea here
)
Can anyone point me in the right direction or explain it in, er, non-greek terms?