Ordinal versus nominal regression models and the problem of correctly predicting draws in soccer
Peer reviewed, Journal article
Published version

View/ Open
Date
2017Metadata
Show full item recordCollections
- Artikler [419]
- Publikasjoner fra Cristin [436]
Original version
International Journal of Computer Science in Sport. 2017, 16 (1), 50-64. 10.1515/ijcss-2017-0004Abstract
Ordinal regression models are frequently used in academic literature to model outcomes of soccer matches, and seem to be preferred over nominal models. One reason is that, obviously, there is a natural hierarchy of outcomes, with victory being preferred to a draw and a draw being preferred to a loss. However, the often used ordinal models have an assumption of proportional odds: the influence of an independent variable on the log odds is the same for each outcome. This paper illustrates how ordinal regression models therefore fail to fully utilize independent variables that contain information about the likelihood of matches ending in a draw. However, in practice, this flaw does not seem to have a substantial effect on the predictive accuracy of an ordered logit regression model when compared to a multinomial logistic regression model. Keywords: association football, forecasting, ordered regression