Analysis of a double Poisson model for predicting football results in Euro 2020
File(s)Penn Euro Fig13.tif (3.12 MB) Penn Euro Fig9.tif (1.97 MB)
Supporting information
Supporting information
Author(s)
Penn, Matthew J
Donnelly, Christl
Type
Journal Article
Abstract
First developed in 1982, the double Poisson model, where goals scored by each team are
assumed to be Poisson distributed with a mean depending on attacking and defensive
strengths, remains a popular choice for predicting football scores, despite the multitude
of newer methods that have been developed. This paper examines the pre-tournament
predictions made using this model for the Euro 2020 football tournament. These
predictions won the Royal Statistical Society’s prediction competition, demonstrating
that even this simple model can produce high-quality results. Moreover, the paper also
presents a range of novel analytic results which exactly quantify the conditions for the
existence and uniqueness of the solution to the equations for the model parameters.
After deriving these results, it provides a novel examination of a potential problem with
the model - the over-weighting of the results of weaker teams - and illustrates the
effectiveness of ignoring results against the weakest opposition. It also compares the
predictions with the actual results of Euro 2020, showing that they were extremely
accurate in predicting the number of goals scored. Finally, it considers the choice of
start date for the dataset, and illustrates that the choice made by the authors (which
was to start the dataset just after the previous major international tournament) was
close to optimal, at least in this case. The findings of this study give a better
understanding of the mathematical behaviour of the double Poisson model and provide
evidence for its effectiveness as a match prediction tool.
assumed to be Poisson distributed with a mean depending on attacking and defensive
strengths, remains a popular choice for predicting football scores, despite the multitude
of newer methods that have been developed. This paper examines the pre-tournament
predictions made using this model for the Euro 2020 football tournament. These
predictions won the Royal Statistical Society’s prediction competition, demonstrating
that even this simple model can produce high-quality results. Moreover, the paper also
presents a range of novel analytic results which exactly quantify the conditions for the
existence and uniqueness of the solution to the equations for the model parameters.
After deriving these results, it provides a novel examination of a potential problem with
the model - the over-weighting of the results of weaker teams - and illustrates the
effectiveness of ignoring results against the weakest opposition. It also compares the
predictions with the actual results of Euro 2020, showing that they were extremely
accurate in predicting the number of goals scored. Finally, it considers the choice of
start date for the dataset, and illustrates that the choice made by the authors (which
was to start the dataset just after the previous major international tournament) was
close to optimal, at least in this case. The findings of this study give a better
understanding of the mathematical behaviour of the double Poisson model and provide
evidence for its effectiveness as a match prediction tool.
Date Issued
2022-05-19
Date Acceptance
2022-05-10
Citation
PLoS One, 2022, 17 (5)
ISSN
1932-6203
Publisher
Public Library of Science (PLoS)
Journal / Book Title
PLoS One
Volume
17
Issue
5
Copyright Statement
© 2022 Penn, Donnelly. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
License URL
Sponsor
Medical Research Council (MRC)
Identifier
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0268511
Grant Number
MR/R015600/1
Subjects
Athletic Performance
Football
Soccer
Football
Soccer
Athletic Performance
General Science & Technology
Publication Status
Published