Who was it said “There are three kinds of lies. Lies, damn lies and statistics”? We asked one hundred people. 37% said “Benjamin Disraeli,” 55% said “Mark Twain” and 42% said “I don’t give a toss!” Half Man Half Biscuit said “If 8 out of 10 cats do prefer Whiskers do the other two prefer Lesley Judd?” John McCormick having set off on a train of thought about whether or not Arsene Wenger uses statistics to help him to decide whether or not to sell players (he doesn’t need stats to tell him Chamakh was a waste of money) has been considering the place of the numbers game in football.
After M. Salut posted some comments about Ibrahimovic, regular Salut contributor, Jeremy Robson and I engaged in a debate, some of which concerned the use of statistics in making footballing decisions. We’re probably not as far apart as that debate suggests but I thought I’d put pen to paper (actually fingers to keyboard but we oldies are bound by metaphor ) and give you my thinking about the use of statistics in football.
I must say, though, that I’m not really a stats person. I don’t download or analyse figures. I rarely even read the Opta stats for SAFC when they are posted on the ALS website. Nor do I want to get into semantics over the differences between information, data and statistics, or even discuss how many Sessegnons can dance on the head of pin (*the answer to that is at the bottom, by the way). I just think, properly used, hard data has an important role to play in the modern football club.
I’ll use a recent trip to Southport to outline some of my themes and then I’ll move on to football scenarios to add a bit of depth:
Last Thursday, when it was sunny, I strolled down to the station and caught a train to Southport, which is a pleasant ride up the coast. I spent the day in Southport and got back about 7.00pm.
Did I have a good time? What do you think?
There’s actually nothing to tell you what the day was like but because of the way I presented (some may say slanted, or biased) the information you might be inclined to think I did have a good time. Even so, you might be reluctant to make a firm decision. So here’s another bit of information:
Last Thursday I spent 7 hours in the Accident and Emergency unit of Southport General Hospital.
Now what do you think?
It’s probable that your view has changed. The second piece of information is presented without any slant but most people’s experiences of A&E units aren’t good and that personal experience will affect judgements. I think you will now be of the opinion that I didn’t have a good day.
However, there are other possibilities to consider. I might be a retired health professional called in to cover absence, or a lay minister volunteering to work with casualties. It’s possible I went home feeling happy and fulfilled.
But the truth is that myself, my friends and my family have no association with Southport General in any professional or volunteer capacity.
So what do you think now? You probably will be more confident that I didn’t have a good day. You could be wrong but by considering all of the information, discarding any which is irrelevant, taking account of bias (in the information or in yourself) and, above all, asking the right questions to get more information, you are likely to reach a much better decision than if you used gut feeling, guesswork or a coin toss.
My argument is that the same applies to football. There’s a lot of information out there in the form of hard facts. If clubs can identify the questions to ask and find the data which provides the answers they might just gain an advantage. The key is to ask the right questions and use the right data. I think from what Jeremy said that he believes clubs don’t always do this. I have to agree with him, and if clubs don’t ask the right questions or make decisions using the relevant data their decisions are indeed meaningless.
Let’s consider this in the context of a hypothetical situation. If Opta stats, which for the purpose of argument we will assume are based on enough data (i.e. enough games played) to be valid, show Phil McBardsley made fewer tackles in 2012-13 than he did in 2011-12, and fewer tackles in 2013-14 than in 2012-13 should the club sell him?
Such data might indicate McBardsley is slowing down and needs to be moved on. However, there are other possibilities. Could it be that he has gained experience and intercepts passes before he needs to tackle, or that someone further up the pitch is making life difficult for the opposition and the ball doesn’t come as deep? If so, it could be a mistake to get rid of him. Or could changes be happening because of MON’s coaching. In this case the data might be the only sure way of telling that it’s working.
On its own the data about tackles is not enough for any decision to be made, but it does flag up something that needs investigation. More questions need to be asked and only when these are all answered might the manager be in a position to make a decision. Clubs are increasingly turning to data to help them answer questions and I think this is a force for the good. Done properly, this must be better than working on intuition and gut feeling. It isn’t all down to the readily available Opta stats, though. Clubs are very private in the data they collect and what they do with it.
Here’s a real-life situation: Match of the day showed a game against Stoke, who were playing Delap, where a defender conceded a corner rather than a throw in. Motty’s comment was something like “That’s one way of preventing a long throw in”. It was, but was it good play?
If this was just a one- off event during a game then that’s OK. It’s nothing more than a talking point. But what if a coach had just watched a “Football Focus” sequence which featured half a dozen goals coming from Delap’s long throws? There’s something called the “availability heuristic” that states information recently received can stay in the memory and influence judgements. The coach might have thought long throws were more dangerous than really was the case and given his players the wrong advice.
To make the best decision the coach would need to know the probabilities of Stoke scoring from a long throw and from a corner. That’s not as simple as it seems because you have to factor in things like goals directly arising from the kick or throw, add in goals from what rugby would call the “second phase” and take account of other effects, such as sending offs or penalties.
But such calculations can be made, and they will be valid as long as enough data is collected. If someone had carried out such an analysis and it showed that a long throw was more dangerous than a corner, then conceding a corner could be a smart tactic. But there are always other considerations. How does your own team handle corners? How does this compare with their handling of balls played in from the wings, which might be the best data you can get to help you focus on long throws? Then what about the effects of weather, sunlight, boggy ground, pitch slope, ice, wind, rain? And don’t forget players and their foibles. It’s a complex situation but in setting up for any game asking the right questions and getting the correct answers could give your team the edge.
The same applies in setting up for any season. You have a limited squad. How will you cope with the speed of Bale, the aggression of Rooney , the experience of Sir Alex, the hostility of ASDA in the derby. What data will you use to maximise your chances of a successful season when you select your squad? Our medal winning cyclists evaluate everything they think might be relevant and operate on what they call the aggregation of marginal gains. Football should do the same.
So here’s a question for you. MON is setting up for 2012-13 and that includes moving players in and out. Should he keep Cattermole or get rid? This site and others such as ALS show it’s an emotive area for fans. That’s fine but do we want MON to behave emotionally? Should he just rely on his judgement, what he sees in training and what his scouts tell him? Or are there statistics, data, or information (take your pick) which might help him make an optimal decision. If so, what are they and where are they? That’s the issue clubs need to get to grips with.
(*Only one Sessegnon can dance on the head of a pin, ‘cos there’s only one Sessegnon)