Fuzzy Logic
Fuzzy Logic
Fuzzy Logic
Report: Cathal Ormond (07617520) Group: Mark Christiansen, Amy Davidson and Cathal Ormond February 1, 2011
Introduction: Overview
The project we decided to do was to create a fuzzy logic system which would model a poker bet. The system takes in four inputs and gives an output which determines the players move. Using this repeatedly, our system was designed to play a game of poker. Initially, it was convenient to assume that there was only one opponent, although it is not too hard to generalise this to multiple opponents. We chose this because there are several fuzzy input which one must consider when considering how to play a hand: what your hand is like, what your opponent bet last hand, whether you believe your opponent is blung or not.
Inputs
We decided on four inputs (linguistic variables) for our fuzzy system: Our Hand, Opponents Hand, The Amount in the Pot and The Ratio of Chips Left.
Our Hand
How good your hand is is given by a value in [0, 1]. In practise, this could not be strictly dened, as every poker player has their own idea of how good a particular hand is. When simulating, Mark (being the best poker player) determined how good the hands were. Our terms for this variable were: Fold, Bad, OK, Good, Great. As can been seen from the diagram, our denition of a good hand is quite conservative.
Opponents Hand
Similar to the above, our opponents hand was also determined by a value in [0, 1]. The problem is that we do not know all the cards our opponent has, only between 3 and 5 of the cards that are out on the table. Hence, our estimate of how good our opponents hand is is slightly more vague than our estimate of our own hand. One can also take into account what the player has bet previously in determining how good our opponents hand is. Our terms for this variable were: Bad, OK, Good. 2
Amount in Pot
We initially assumed that there would be no more than 200 chips in the pot, and that each player would have 100 chips each. This can be modied easily enough to include more opponents, or a higher number of chips each. This value is not fuzzy, and lies in [0, 200].
Chip Ratio
When placing a bet, it is often useful to determine if the chips remaining are to our advantage or to our opponents. If x is the number of chips we have left, and y is the number of chips our opponent has left, then a simple ration is given by Chip Ratio = xy x+y
This value lies in [1, 1]. Thus, if we have no chips left, i.e. x = 0, then the ratio is 1. If we have the same number of chips as our opponent, i.e. x = y , then the ratio is 0. Finally, if our opponent has no chips left, i.e. y = 0, then the ratio is 1. Our terms for this variable were: Opponent Advantage Large, Opponent Advantage Small, Even, Our Advantage Small, Our Advantage Large.
Rules
For our section on rules, we used both common sense and strategy to determine what one should do with the situation presented. For example, if our hand is Bad, and our opponents hand is anything but Bad, then there is no point playing as there is little chance of winning. Thus, our output should be to fold. Similarly, say if our hand is Good, and our opponents hand is not Good, then we should indeed bet, but how much is determined by the chip ratio. If the chip ratio to your advantage and small, make a medium bet. If it is large to your advantage, bet large. Otherwise, bet small. 4
Output
In this project, we chose a Mamdani system. The output was designed not only to tell us what to do but, if betting, how much we should bet. Therefore, we had the following terms: Fold/Check, Small, Medium, Large, All in. Our output was originally designed to be in the range [0, 1], but we allowed some small values outside these bounds, so in practise, the range was [0.2, 1.2]. Anything below 0 means that we should not bet anything, so fold or check if possible. Anything above 1 means that we should go all in. Thus, the values between 0 and 1 represent - roughly - the ratio of chips that we have left that we should bet.
Table 6: Our Output We considered the output based on our opponents hand versus the chip ratio. As you can see in the picture below, the worse your opponents hand is, the higher the bet. Also, once the chip ratio is even, and you opponents hand is anything but great, the bet increases.
Stages
There were four stages in our Mamdani system: fuzzication, inference, composition and defuzzication.
Fuzzication
We took in crisp values for our input and determined by the membership functions which terms they applied to. For example, the following data: Our Hand = 0.55, Opponents Hand = 0.4, Amount in Pot = 20, Chip Ratio = 0.3 would mean that our hand is Good, our opponents hand is OK, there are 20 chips in the pot, and we have a small chip advantage. 7
Inference
When our rules re, we use inference. In the above example, we have a rule stating that when our hand is good, and our opponents hand is not good (i.e. ok) with the ratio of chips is small to our advantage that we should bet medium. In this case, the amount in the pot doesnt actually aect the output.
Composition
When more than one rule res, we may not have a readily available output for our system. We use composition to combine these rules into one fuzzy set.
Defuzzication
Finally, we can defuzzify our system through our inputs to come up with a single output for our system.
Simulation
We tested our system a couple of times: rstly to determine if we were on the right track with our membership functions and then to determine if our system worked. At rst, we considered a professional game of poker online. We chose a particular player and before he made his bet, we estimated the values for our input and had our fuzzy system determine what the players bet should be. We found some aws with our values and that helped us to re-evaluate our rules. Once we thought our system was accurate enough, we all played a game of poker with Mark recording his hand, what he felt our hands were, and what the chip ratio was. We all started with 100 chips and took note of our bets. At the end, we inputted our data into our system to determine what it thought Mark should have done. In almost all cases, the betting made was consistent with our outputs.
Analysis of System
The main problem with our system was that we had no way of dealing with players blung, and the system does not have any method of recommending that we blu. For example, in our rst test, our player had a hand that we determined as fold-worthy and his opponent had a bad hand. Our system gave the output of folding, but our player ended up going all in and winning the hand. Had we spent more time testing the data and adapting the rules, I feel our output would have been more accurate. Also, it would have been a good idea to try a game of poker where our bets were determined by our systems output, instead of checking afterwards if our output matched the bets made.
Conclusion
In the end, we were happy with the output of our system. Our inputs were both fuzzy and crisp, and we had both full and partial information. There were only a few small cases in which we bet too conservatively compared to the system, but to have changed these rules would have meant skewing the entire membership functions. The system was perhaps too conservative for professional poker players and often failed when blung was encountered, but for non-professional games of poker, the system worked wells.