Q. Does Winning the Toss Matter? A. Not Really

O.K. so you’ve won the toss by fair means or foul, but in the great statistical stream of things, does that matter?  Answer: probably not.

De Silva and Swartz (1997) started from the position that winning the toss was undoubtedly important in multi-day games, but questioned the validity of that assumption in one-day games.  So their study focused on the results of One Day Internationals (ODI’s). They looked at the results of 427 O.D.I. games played in the 1990’s and looked to see if there was any correlation between winning the game and winning the coin toss.  The eight games of the 427 that ended in a tie were discarded.

Four different statistical methodologies were used to compare the performances of the teams that won and lost the toss, I’ll spare you the details largely because I don’t understand them.  The upshot was that no matter which methodology was used no evidence could be found of the coin toss having an impact on the outcome of ODI games.

Interestingly, despite de Silva and Swartz’s presumption of an advantage from winning the toss in multi-day cricket, Allsopp and Clarke (2004) found no evidence of such an advantage.

“It is established that in test cricket a team’s first-innings batting and bowling strength, first-innings lead, batting order and home advantage are strong predictors of a winning match outcome. Contrary to popular opinion, it is found that the team batting second in a test enjoys a significant advantage. Notably, the relative superiority of teams during the fourth innings of a test match, but not the third innings, is a strong predictor of a winning outcome. There is no evidence to suggest that teams generally gained a winning advantage as a result of winning the toss.”

The other interesting point here is the clear advantage Allsopp and Clarke found in teams batting second in test matches, which gives the lie to Shane Warne’s “win the toss and bat” mantra.  Possible reasons for this ‘bat second’ advantage could be that teams batting second tend to bat on days 2 and 3 when test strips are likely to be at their best, and perhaps modern tracks don’t crumble as much or as often as is commonly supposed.This is what Allsopp (2005) has to say on the matter;

“…the dominance of the team batting second cannot be overestimated, and the results clearly describe an unexpected trend that has emerged in Test cricket. The results strongly indicate that to improve their winning chances teams should expose their particular strength, whether that be batting or bowling in the final rather than the penultimate innings.  This puts paid to the mythical notion…that when given the opportunity, teams should elect to bat first.”

So to summarise, in one day games there is no advantage to winning the toss.  In Test matches there is no demonstrated advantage to winning the toss, but that’s only because captain’s consistently choose the wrong option, i.e to bat first.

o0o

Bibliography

Basil M. de Silva and Tim B. Swartz 1997 ‘Winning the Coin Toss and the Home Advantage in One-Day International Cricket Matches’, The New Zealand Statistician 32: 16-22

Allsopp, P.E. and Clarke, Stephen R. ‘Rating teams and analyzing outcomes in one-day and test cricket’ Journal of the Royal Statistical Society: Series A (Statistics in Society) Volume 167, Issue 4, pages 657–667, November 2004

Allsopp, P.E. ‘Measuring Team Performance and Modelling the Home Advantage Effect in Cricket.  PhD Dissertation Swinburn University of Technology pages 273-274

New Methods For Determining Batting Performance In Short Sequences of Games

Cricket Moneyball Two – assessing batting performance over a relatively short period of time.

During the course of an entire career the conventional statistical methods for determining batting prowess work reasonably well. We can for instance determine that with a career test average (Ave) of 99.94 Don Bradman was a half decent test batsman.

The problems arise when we are assessing player performance over a relatively short period of time, when we do not have a large number of innings to sample.  This can for instance become an issue when we are attempting to determine current short-term form, or a player’s performance in a given tournament.

There are a variety of potential problems here, varying game conditions for instance (more of this in a later post), but chief amongst these issues is the batsman who has a high number of not out scores which can distort his (or her) average. High numbers of not-outs may be down to the batsman’s innate brilliance, blind luck, or their position in the batting order, we cannot tell. This can lead to an erroneously high AVE which is calculated by dividing runs scored by times out (AVE=R/W). The most frequently cited example of this ‘not-out bias’ is the case of Lance Klusener who, in the 1999 World Cup scored 281 runs in nine innings while only being out twice.  This gave Lance an Average of 140.5, despite having a high score of only 52! Clearly a nonsense.

The first attempt to deal with this problem that I can find comes from ‘the two Alans,’ Alan Kimber and Alan Hansford (1) who attempted to draw on earlier work in survival analysis (Cox & Oakes 1984) and reliability analysis (Crowder, Kimber, Smith & Sweeting 1991) to produce a more rational means of batting performance indication.

I am reliably informed that Kimber & Hansford “argue against the geometric distribution and obtain probabilities for selected ranges of individual scores in test cricket using product-limit estimators…” (1)

No, I have no idea what that means either, so you will be relieved to know that others [Durbach (3) and Lemmer (4)] have since demonstrated that this system is almost as unreliable as AVE. So we can forget them and move on.

At this point our old friend H.H. Lemmer comes to our assistance again in (4) & (5) he argues that his analysis showes that if a not-out batsman had been allowed to bat on, he could reasonably expect to score twice the runs that he actually scored.  So logically, if we double the not out scores and count those innings as wickets we have a more accurate assessment, right? Well, not quite. Nothing is quite that simple in the wonderful world of cricket moneyball.

The formula derived by Lemmer from his insight is

e6 = (summout + 2.2-0.01 x avno) X sumno/n

where

n denotes number of innings played

sumout denotes the sum of out scores

sumno denotes the sum of not out scores

avno denotes the average of not out scores

However, if you were to simply double the not out scores and call that innings an ‘out’ you do end up with a very similar figure to e6.

 To put that into Lemmer’s parlance, the formula for this simpler method is

e2 = (sumout + 2 x sumno)/n

as you would expect.

Lemmer himself calls this ‘a good estimator’ and that’s good enough for me, this is the formula that I use for day in day out assessment of batting performance in single day games.

Coming to a spreadsheet near you.

There is one caveat, where there is one single very large not-out score the difference between e2 and e6 can become very large (>10), in which case we can use the measure e26 which is found by:

e26 = (e2 + e6)/2

o0o

Bibliography

1) Kimber, A.C. and Hansford, A.R. (1993) A Statistical analysis of batting in cricket. Journal of the Royal Statistical Society Series A 156 pp 443-455

2) Tim B. Swartz et al, (2006) Optimal Batting Orders in One day Cricket, Computers and Operations Research 33, 1939-1950

 

3) Ian Durbach et al (2007) On a Common Perception of a Random Sequence in Cricket South African Statistical Journal

 

4) Lemmer H.H. (2008) Measures of batting performance in a short series of cricket matches. South African Statistical Journal 42, pp 83-105

5) Lemmer H.H. (2008) An analysis of players’ performance in the first cricket Twenty/20 World Cup series. South African Journal For Research in Sport, Physical Education and Recreation 30 pp71-77