By Elliot Chin
Lately, because the Boston Celtics had been getting ready to historical past, trying to be the primary group to return again from a 3-0 sequence deficit, a pal requested me if streakiness had something to do with the Celtics’ nail-biting Japanese Convention Finals efficiency. This bought me considering: what does it imply for a group to be streaky? And if a group is streaky, will that affect their future play?
These questions are associated to the “sizzling hand” debate, one of many earliest forays of arithmetic into sports activities. Certainly, streakiness has been analyzed in lots of sports activities, however normally on a participant stage. Some outcomes—resembling a contemporary reexamination of the recent hand impact—counsel that streakiness does exist, whereas others point out that momentum may very well have a adverse impact on athletes.
My pal’s remark, nonetheless, had me questioning whether or not streakiness exists not on a participant stage however on a group stage. Had been the Celtics an particularly streaky group who had been in a rut till they lastly hit their stride?
Quantifying Streakiness
Step one to assessing streakiness in NBA groups is to find out an efficient metric for measuring how streaky a group is. Many such metrics have been developed, however I selected to concentrate on two. The primary is autocorrelation, a statistic used steadily for assessing stochastic processes, resembling in finance. Merely put, autocorrelation is calculated by taking the correlation between a dataset and the lagged model of that very same dataset. If we need to assess the streakiness of NBA groups, we might thus take the correlation between a group’s win/loss efficiency throughout video games 1-81 of the season with the group’s win/loss efficiency throughout video games 2-82 of the season. If these two datasets are extremely correlated, it implies that the group is streaky, as a win on one evening implies the next likelihood of a win the following evening.1
The second metric I exploit is known as the runs take a look at. The runs take a look at basically quantifies the variety of “runs” {that a} group goes on, making it a really intuitive metric for measuring streaks. If a group wins 3, loses 4, then wins 3, it has gone on 3 runs. Comparatively, if a group alternates wins and losses over 10 video games, it has gone on 10 separate runs. The existence (or lack thereof) of streaks thus has a large influence on this easy statistic. Within the curiosity of making a rescaled statistic, I exploit a barely modified metric constructed off the runs take a look at that I dub “runs price.”2 The next runs price signifies a extra streaky group.
Naturally, one of the simplest ways to check these metrics can be to use them to the NBA season that we simply witnessed. Proven under are the autocorrelation in addition to normalized runs price for every NBA group for the 2022-23 season. Distributions in addition to normalized win percentages are included for context. Whereas there are a number of noticeable variations between the 2 streakiness metrics developed, there are some clear standouts throughout each graphs. The Rockets had been particularly streaky—maybe inevitable with their abysmal report—whereas the Hawks solely ever had two streaks of at the very least 4 wins or losses. Related to our preliminary query concerning the Japanese Convention Finals, the Celtics are pretty streaky whereas the Warmth are usually extra constant. Particular numerical information is included within the footnotes.3

IS IT JUST RANDOM CHANCE?
Now that now we have some concrete measures of streakiness, we are able to start to investigate what components contribute to or are impacted by a group’s streakiness. Some of the apparent locations to begin is with group energy. Take into account the sting case the place a group wins or loses each sport—the group can be extremely streaky. As such, a place to begin for evaluating these metrics is taking a look at how they examine to group efficiency.
Autocorrelation has no demonstrable relationship to a group’s win share. Groups that win rather a lot, not rather a lot, or someplace within the center all can have excessive or low autocorrelation statistics. Certainly, essentially the most lopsided group by way of wins and losses—the Detroit Pistons—can also be the second least streaky by autocorrelation (maybe as a result of they solely as soon as strung collectively back-to-back wins). On this graph, we additionally start to see that there could be some benefit behind the declare that the Celtics are a streaky group who wanted to—however couldn’t—get sizzling. By autocorrelation, they’re roughly the seventh most streaky group, owing partly to their two 9 sport profitable streaks.

Runs price is a special beast altogether, and is very depending on group efficiency. Within the graph under we witness a U-curve form, as groups that win or lose rather a lot could have many runs of consecutive wins or losses. A Detroit Pistons sport has an over 60% probability of getting the identical consequence because the Pistons’ earlier sport—however that’s solely as a result of they’re prone to lose each video games!
However, groups just like the Milwaukee Bucks and Boston Celtics have lengthy runs of wins and losses—however principally wins—as a result of they win rather a lot, and thus runs are to be anticipated. Runs price thus has a key downside in that it could actually conflate streakiness with being an outlier in uncooked win/loss efficiency. As such, in a while, we’ll discover methods to manage for the influence that group energy can have on streakiness, with a purpose to actually discover which groups are extra (or much less) streaky than anticipated.

In evaluating the usefulness of autocorrelation and runs price, one would possibly ponder whether they maintain worth as predictive measures. The reply is blended, however of their present kind, seemingly no. To check this query, I plotted groups’ efficiency on streakiness metrics through the first and second half of the season.
Autocorrelation, clearly, isn’t self-predictive. In different phrases, a group having excessive autocorrelation and being streaky says little about whether or not they may proceed to be streaky sooner or later; streakiness isn’t a “replicable” statistic.

The story appears to be like—initially—completely different for runs price. Nonetheless, we all know from earlier graphs that runs price is closely influenced by group energy, which can persist over the course of the season. As such, it is sensible to manage for this issue. When subtracting out the anticipated runs price for a given group, calculated through simulation, we arrive at the same consequence as with autocorrelation: little to no relationship between a group’s streakiness within the first and second half of the season. The calculation of anticipated runs price is included within the footnotes.4
This maybe brings us to a solution to my pal’s query. Whereas it might be true that sure groups, with the good thing about hindsight, are extra streaky than others, it’s unlikely {that a} group has some inner high quality (apart from being good or dangerous, that’s) of being particularly streaky that constantly impacts their play. The Celtics, for instance, occurred to be streaky to start with of the season, then had been streaky within the second half of the season, then had been streaky within the playoffs. However given the sheer amount of groups that had a special, extra contradictory relationship with streakiness, it might be an error to confidently assign streakiness to the Celtics when their playoff report might have been as a consequence of random probability


ENOUGH MATH – WHICH TEAMS ARE STREAKY?
The shortage of self-consistency of autocorrelation and runs price doesn’t imply that streakiness isn’t an attention-grabbing metric to look at in any respect. Fairly, we must always take into account it as a descriptive, slightly than a prescriptive, statistic. Much like fielding statistics in baseball, streakiness metrics do a lot better at measuring how streaky a group was than predicting future streakiness.
To this finish, our runs price over expectation metric really gives numerous perception in highlighting groups which will have been considerably roughly streaky than regular. Beneath, now we have the runs price over expectation scores for every of the 30 NBA groups for the 2022-23 season overlaid over a histogram displaying how streaky every group can be anticipated to be. Groups are ordered from most to least streaky.
A pair groups stand out. New York Knicks followers shall be conversant in their sometimes-exciting-sometimes-infuriating penchant for streakiness: they began the season with two losses, then 5 wins, then three losses, then three wins, then three losses, then 9 wins. It wasn’t till February once they had their first stand-alone win.
On the opposite finish of the spectrum are the Atlanta Hawks, who I mentioned earlier on this article. However surprisingly, protecting them firm as probably the most constant strength-adjusted groups is the Miami Warmth. For a group that has made waves for having the ability to “flip it on,” with a star that may “will” his group to victory, the Warmth have had a remarkably unexciting common season.
Conclusion
Streaks in basketball could be essentially the most exhilarating and demoralizing elements of the sport: the underdog getting sizzling or a group blowing a 3-1 lead. Whereas followers would possibly assume there’s something inevitably streaky about their group, I present right here that streaks are principally a product of luck and randomness, slightly than some intrinsic property of a group.
As an alternative of getting used predictively—for instance, to forecast sport 7 of the Japanese Convention Finals—they’re higher used descriptively to grasp the form of a group’s season. Basketball is inherently stochastic and random, however evaluating streakiness can elucidate the order and topology underlying this randomness. The Hawks and Knicks, regardless of the same end-of-season report, took followers and the media upon remarkably completely different journeys all through. Solely by evaluating these groups with how they may have been anticipated to carry out will we see simply how irregular their seasons had been.
Footnotes
1 For half-season autocorrelation calculated later within the article, I exploit the correlation between video games 1-40 and 2-41, and the correlation between video games 42-81 and 43-82. Autocorrelation as a metric may very well be improved if a number of timeframes of lagging the dataset are included (ie. one sport, two video games, and so forth.) however nonetheless a streaky group ought to have greater autocorrelation. As a result of correlation is normalized to the variance in every dataset, autocorrelation has the great property of already being (comparatively) standardized to a group’s total efficiency.
2 The runs price statistic is impressed by an expensive highschool instructor who taught it to me as a manner of evaluating the randomness of stochastic processes. I didn’t know the “Wald–Wolfowitz runs take a look at” was a factor till I started doing analysis for this text and sought a formalization of this technique! Anyhow, the runs price is the variety of video games that comply with a sport of the identical consequence, divided by the full variety of video games performed minus one. It may be considered the possibility {that a} random sport performed (excluding the primary sport performed) had the identical consequence as the sport performed earlier than it. How does this relate to the runs take a look at? Mathematically, in a season with X video games, let’s say the runs take a look at concludes that there are Y runs. On this case, the runs price would equal (X-Y)/(X-1). Normalized runs price refers to runs price minus .5, to place it on the identical scale as autocorrelation and to have a metric the place 0 is common.
3 Particular numeric group outcomes are displayed in a desk right here.
4 Anticipated runs price was calculated because the imply of runs price throughout 10,000 simulations of a permutation take a look at for every group. A group’s report (both 41 or 82 video games, relying on half or full season) was permuted randomly many instances, and the runs price was calculated for every simulation. The common runs price throughout every of those runs was used because the anticipated runs price. Word that this permutation take a look at, whereas making an allowance for group energy and report, doesn’t management for different components which will have an effect on streakiness resembling residence/away benefit. For all groups—however particularly the group I root for, the Golden State Warriors—accounting for this issue would improve anticipated streakiness. Subsequent steps for this challenge might contain a simulation take a look at constructed off of ELO sport predictions with residence/away benefit resembling 538’s mannequin. Betting odds wouldn’t be a smart alternative for a simulation take a look at as they seemingly already take note of any streakiness-based issues.