Poker-AI.org

Poker AI and Botting Discussion Forum
It is currently Mon Nov 13, 2023 5:40 pm

All times are UTC




Post new topic Reply to topic  [ 17 posts ] 
Author Message
PostPosted: Mon Apr 22, 2013 12:22 am 
Offline
Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216
I found a ton of information about GTO bots, but few research has been published about good exploitive or adjustive bots. For instance, I would like to know which techniques bots like Neo, PULPO or MANZANA are employing. Unfortunately, all I could find are "one-liners" about those bots or longer papers about bots that basically failed... Anyone knows how to find more on this topic?


Top
 Profile  
 
PostPosted: Mon Apr 22, 2013 7:25 am 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
PULPO or MANZANA weren't adaptive.

Neo has/had a website.


Top
 Profile  
 
PostPosted: Mon Apr 22, 2013 9:14 am 
Offline
Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216
I read a post from Johanson in the old forum where they discussed the competition and he argued that so far it was basically a contest of having the best equilibrium, but due to manzana and his dominance in one year he assumed that people would change from pure equilibrium, so I assumed he was at least exploitive. Do you have any infos that he is an equilibrium solution too?

Regarding Neo: I saw the website, but the information there is basically just a few keywords without saying anything :(


Top
 Profile  
 
PostPosted: Mon Apr 22, 2013 12:49 pm 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
Manzana was an NN trained on actions from EQ(s) then purified. The U of A did testing on it's exploitability and found that it was the most exploitable of all the contestants, despite it's dominance. But, since all the contestants were EQ-based, none of them adapted to Manzana's (obvious) strategy. Pulpo was the predecessor to Manzana -- same concept, same author.

If you really want a challenge, try figuring out how Sonia works/worked. ;) Now offline: pokerparadime.com


Top
 Profile  
 
PostPosted: Mon Apr 22, 2013 11:51 pm 
Offline
Site Admin
User avatar

Joined: Thu Feb 28, 2013 5:24 pm
Posts: 230
Yeah Sonia is really the only one I know that supposedly is highly adaptive and a great player. Not much information out there about it though and as Nasher said they seem to be out of business now too... whatever their "business" was before.

_________________
Cheers.


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 2:23 am 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
I (thought) I actually found one of the authors of Sonia on Facebook (Marc Cohen). He didn't respond to my message. :( There's a Marc Cohen that has/had a company in the UK called "All in Equity" (get it?) that declared $30k in revenue at some point. He seems to have dropped out of the poker world. Probably sold his idea for a few million and retired. I'd show you his Google+ profile, but I'm not sure he'd want it posted on a public forum.

I chatted briefly with Spears about Sonia on the old forum, we think she's actually modeling herself from her opponent's perspective, then solving some linear equation between her own (external) range and her opponent's. The authors mentioned she's technically a Bayesian model, but don't go into detail about how she functions. She did beat Polaris, which at the time was the strongest EQ (publicly known).


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 5:22 am 
Offline
Site Admin
User avatar

Joined: Sun Feb 24, 2013 9:39 pm
Posts: 642
IIRC Polaris 2008 had several "tilted" equilibrium strategies from which it chose the best at runtime according to opponent. http://en.wikipedia.org/wiki/Polaris_(poker_bot)


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 7:15 am 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
If I'm not mistaken, I think they only played Orange (the default EQ) versus Sonia.


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 9:45 am 
Offline
Site Admin
User avatar

Joined: Sun Feb 24, 2013 9:39 pm
Posts: 642
There's not much info about them but I think OneDayItllWork and PeppaPig have successful exploitative bots


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 12:53 pm 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
I do too, just not for HUNL. :D


Top
 Profile  
 
PostPosted: Tue Apr 23, 2013 9:49 pm 
Offline
Regular Member
User avatar

Joined: Tue Mar 05, 2013 9:19 pm
Posts: 50
Searching for information on poker agents using Neural Networks/Machine Learning techniques will give you a few results.
Most seem based (very loosely) on learning opponents via history, then simulating the current game state to find the action with the highest EV, then using this action - possibly using a mixed strategy.
However, this almost becomes a task in ML solely as you'll try to aim for the highest % on your classifier for modelling opponents.
I imagine it gets more technical and interesting when trying to adapt to shifting opponents, meeting new opponents and finally attempting to not be an easily exploitable agent.

I'll be working on a Best Response agent next year for my thesis (NL HU or 6-max) - have a basic bot at the moment, but only at around 71% accurate on predicting opponent actions.

I can have a dig around for some links if you'd like? If it's what you're looking for anyhow?


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 12:40 am 
Offline
Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216
I found a lot of papers, but they lacked in either one of those areas:
1. they either require an exact understanding of opponents private strategy (this would require in depth knowledge we typically don't have)
2. they are only able to adjust after a very long history
3. they are not performing pretty good

If you know a paper that doesn't have these drawbacks, I'd be happy if you can give me a reference. The use case I'm interested in is finding ways to exploit in short-run games like SNGs where we have only a very limited number of observations.


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 2:01 am 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
The recent U of A paper on using clones and Exp4? Or did you not want to use a DBR approach?


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 3:07 am 
Offline
Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216
even though its one of the better papers, the clones could only be created in an articifial setup where they had access to the hands from last years competition (many hands) *and* hole card information was available in each game, even if villain did fold... And the same information is also used (as far as I understood) in the selection process of the best agent. So I think its hard to generalize the findings in a real-world setting.


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 3:12 am 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
I don't think the clones were the important aspect of the paper, rather their use of Exp4 and findings that smaller agent profiles perform better. You could replace "clone" with "dbr player" throughout the paper.


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 6:32 am 
Offline
Senior Member

Joined: Mon Mar 11, 2013 10:24 pm
Posts: 216
maybe I got it wrong. I understood the portfolio thing and that the smaller was better. However, based on which criteria did the exp4 select the bandit to play? as far as i understood, they did EV calcs, but aren't those based on the private information too? For instance, you have 3 models A, B and C. How do you - based on a small history of say 50 hands with 3 hands that went to showdown select the best model?


Top
 Profile  
 
PostPosted: Wed Apr 24, 2013 3:20 pm 
Offline
Veteran Member

Joined: Thu Feb 28, 2013 2:39 am
Posts: 437
Importance Sampling? ... doesn't require knowing the op's strategy.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 17 posts ] 

All times are UTC


Who is online

Users browsing this forum: No registered users and 2 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Powered by phpBB® Forum Software © phpBB Group