# In MMA winners keep winning and losers keep losing

23 Aug 2016In MMA, momentum is huge. Fighters who continue to extend their win streaks by dominating opponents can quickly work their way up the tables, awarding them the opportunity to fight tougher opponents, move to higher caliber fight organizations and maybe, ultimately win a title shot. On the flip side, it can be hard for fighters to recover from a loss; often a single loss is often a sign that a fighter’s confidence is shaken and their future performance will suffer.

In this post, I address one aspect of momentum in MMA: how a fighter’s performance in one fight affects their performance in their next fight. Summarizing the careers of over 150,000 fighters, I see that wins at any stage of a fighter’s career are an indicator of future success, while early losses are often enough to force a fighter to rethink their MMA dreams.

Each fighter’s career can be broken down into a simple sequence of wins and losses. For example, the career of Randy “the Natural” Couture can be summarized as: WWWWLLWWWLWWWLLWWLWLWLWWLLWWWL

To investigate how wins or losses impact future performance, instead of viewing Couture’s career in its entirety, we can investigate how a single win or loss impacts performance in the subsequent fight. To do this, I will make the simplifying assumption that careers are memoryless (a Markov process), so a fight’s result will only directly depend upon the fight immediately before it. For example, a fighter’s performance in their third fight ($n+1$) depends upon their second fight ($n$), but not on their first fight ($n-1$).

Adopting this convention, Randy Couture’s career can be described as a sequence of transitions: WW, WW, WW, WL, LL, LW, WW, WW, WL, LW, WW, WW, WL, LL, LW, WW, WL, LW, WL, LW, WL, LW, WW, WL, LL, LW, WW, WW, WL

Rather than just considering a fighter’s career as a sequence of wins and losses, we can also consider one other important milestone in each fighter’s career: their retirement. Couture hasn’t fought since 2011, so we can add one additional state to his career’s trajectory: his ultimate retirement. The final transition in his Markov chain will then be “LR”.

To identify trends in these trajectories, we can generate these Markov career trajectories for all fighters.

## Processing fighter careers

The first step in building Markov career trajectories is to order each fighter’s fights by date, and reducing their career to a sequence of states: wins, losses and if they are not active, retirement (for our purposes, no wins/losses in the last 14 months).

Fighter_link | Result | Fight_number |
---|---|---|

/fighter/Aaron-Harris-60857 | retired | 3 |

/fighter/Aaron-Harrison-87433 | win | 1 |

/fighter/Aaron-Harrison-87433 | retired | 2 |

/fighter/Aaron-Hart-60266 | win | 1 |

/fighter/Aaron-Hart-60266 | loss | 2 |

/fighter/Aaron-Hart-60266 | retired | 3 |

/fighter/Aaron-Hartman-39354 | loss | 1 |

/fighter/Aaron-Hartman-39354 | win | 2 |

/fighter/Aaron-Hartman-39354 | win | 3 |

## Considering fights as a markov process

With each fighter represented as a sequence of wins and losses, possibly ending in retirement, we can reduce these careers into a sequence of Markov chain transitions. This can be easily performed using an inner join of fight# and fight#-1 for each fighter.

from | from_state | to | to_state |
---|---|---|---|

1 | win | 2 | retired |

1 | loss | 2 | retired |

1 | win | 2 | win |

1 | loss | 2 | loss |

1 | win | 2 | loss |

1 | loss | 2 | retired |

1 | loss | 2 | retired |

1 | win | 2 | win |

Patterns such as how frequently wins are followed by losses cannot be accurately tested for most fighters because the average career length of fighters is very short. To circumvent this limitation, we can assume that the identity of individual fighters is not important and we are just interested in sequences of win-loss-retirement transitions. Having ignored individuals, we can pool sequences, for instance, to test how frequently wins are followed by wins, losses or retirement, and how these transitions change with how many fights have elapsed. Even with this pooling, we will still not have much data at very high fight numbers, like the transition between fight 100 to fight 101. When looking at these larger fight numbers, some pooling is done for visualization and to improve accuracy. Interestingly, extremely high fight numbers are almost solely informed by a single fighter, Travis Fulton (the pathological “Fulton zone”). Travis Fulton has had more than 300 professional fights, almost 170 fights more than the second most prolific fighter Shannon Ritch.

from_state | to_state | label | nn | transition_Pr | from_label | to_label |
---|---|---|---|---|---|---|

loss | loss | 1 | 19215 | 0.1450265 | 1-loss | 2-loss |

loss | retired | 1 | 43010 | 0.3246209 | 1-loss | 2-retired |

loss | win | 1 | 14082 | 0.1062849 | 1-loss | 2-win |

win | loss | 1 | 15791 | 0.1191837 | 1-win | 2-loss |

win | retired | 1 | 18792 | 0.1418339 | 1-win | 2-retired |

win | win | 1 | 21603 | 0.1630501 | 1-win | 2-win |

loss | loss | 2 | 11002 | 0.1660579 | 2-loss | 3-loss |

loss | retired | 2 | 12889 | 0.1945392 | 2-loss | 3-retired |

loss | win | 2 | 8634 | 0.1303167 | 2-loss | 3-win |

win | loss | 2 | 10931 | 0.1649863 | 2-win | 3-loss |

## Visualizing transitions between wins and losses across fighter’s careers

Now that we have summarized how frequently wins and losses are followed by either a win, loss, or retirement at the different stages of fighters’ careers, we want to generate some informative summaries. To summarize these transitions, I will present three alternative visualizations that highlight different features of the data.

### Conditional transition probabilities

To determine how the result of one fight (win or loss) affects the next fight (win, loss, or retirement), we can use conditional probabilities, for example how often a fighter wins given that they won their previous fight. Because we are also interested in how the trajectories of a fighter’s career change with experience, we want to visualize how these transition probabilities might change with a fighter’s number of fights.

This fitted scatter plot reveals several important aspects of win-loss transitions:

- If a winning fighter keeps fighting, they are likely to keep winning regardless of where they are in their career (~60% chance).
- If either a very junior (<5 fights) or extremely experienced fighter (>30 fights) loses a bout, they are likely to keep losing; however, fighters with intermediate experience are more likely to rebound from a loss.
- Fighters are very likely to retire within their first few fights; the chance of retirement decreases quickly with experience, and fighters are much more likely to retire after a loss than after a win.

### Transitions on a network

In order to estimate transition probabilities, I assumed that MMA career results could be represented as a Markov process, whereby a fighter’s behavior in a future fight could be fully described by their previous fight’s outcome. From the previous plot, it is not clear that I am describing fights as transitions between distinct states. To convey this point, the quantitative data from the above plot can be projected onto a network diagram generated using igraph and ggraph.

This network-based plot conveys the nature of the full analysis while containing the same data as the scatter plot. (The exact transition probabilities are more clear in the scatter plot, however.) This network visualization suggests that at most experience levels, there are effectively two tracks: wins lead to further wins, and losses lead to further losses, while transitions between these two tracks are less frequent.

### Sankey plot

While the two previous plots effectively summarize the state and transitions of a fighter’s career, they do not convey one important aspect of these states; the frequency of the states. As shown above, most fighters are likely to retire quickly following an early loss, or even a win. Using a Sankey plot (available using the googleVis package), we can convey both the transitions between and the frequency of states.

While the Sankey plot captures the two-track nature of the network plot, it more clearly conveys the role of retirement in shaping the pool of fighters. Since the only states represented in this plot are wins and losses, when a win or loss is followed by retirement the plot will just trail off. Thus, as we progress left-to-right on this plot from junior fighters to experienced fighters, we see a massive increase in white space that reflects the washout of fighters before reaching this milestone.

### Summary

Using a Markov modeling approach, I was able to reveal some interesting aspects of MMA fighters’ careers. Generally, winners keep on winning and losers keep on losing, or they quickly retire. This two track nature of MMA careers suggests that either a fighter carries momentum into their next match (for better or worse) or that many MMA fights are mismatched such that fighters reliably win or lose repeatedly. By applying the Markov assumption we cannot discriminate between the role of momentum and mismatches; however in my next post, I will evaluate the relative influence of these factors.