Research Article
Semi-Parametric Bayesian Estimation of Sparse Multinomial Probabilities with An Application to The Modelling of Bowling Performance in T20I Cricket
Lahiru Wickramasinghe1*, Alexandere Leblanc2 and Saman Muthukumarana2
1Department of Mathematics and Statistics, University of Winnipeg, Winnipeg, Canada
2Department of Statistics, University of Manitoba, Winnipeg, Canada
Lahiru Wickramasinghe, Department of Mathematics and Statistics, University of Winnipeg, Winnipeg, Canada
Received Date:November 30, 2022; Published Date:January 23, 2023
Abstract
We consider modeling bowling performance in Twenty20 international cricket using a semi-parametric Bayesian approach. The bowling performance can be represented as a contingency table and typically yield a sparse contingency table due to cells with small counts and/or zeros. This sparsity is common in Twenty20 international cricket when we have many classification statuses with many levels, even when the sample size is large. Using a Dirichlet process in our proposed model, the multinomial probability vectors are supported on a discrete space, which enables the borrowing of information across data while providing a natural clustering mechanism. Another important feature of the approach is that this borrowing of information also allows the resulting estimators to handle sparsity, a common concern in multinomial data with many categories. The performance of the approach is compared against some of the standard methods available in the literature; James-Stein, empirical Bayes, and Bayesian multinomial regression estimation. To illustrate our modelling strategy, we suggest a simple way to assess the bowling performance of 175 world-class bowlers.
Keywords:James-Stein estimator; Empirical Bayes estimator; Dirichlet process; Multinomial regression; cricket; Sparse data
List of Abbreviations:MLE: Maximum Likelihood Estimator; ML: Maximum Likelihood; JS: James-Stein; EB: Empirical Bayes; MSE: Mean Squared Error; BMR: Bayesian Multinomial Regression; DP: Dirichlet Process; OP: Overall Proportion; ICC: International Cricket Council
-
Lahiru Wickramasinghe*, Alexandere Leblance and Saman Muthukumarana. Semi-Parametric Bayesian Estimation of Sparse Multinomial Probabilities with An Application to The Modelling of Bowling Performance in T20I Cricket. Annal Biostat & Biomed Appli. 5(1): 2023. ABBA.MS.ID.000605.
-
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.