Journal article
Machine learning does not improve humeral torsion prediction compared to regression in baseball pitchers
- Abstract:
-
Background
Humeral torsion is an important osseous adaptation in throwing athletes that can contribute to arm injuries. Currently there are no cheap and easy to use clinical tools to measure humeral torsion, inhibiting clinical assessment. Models with low error and “good” calibration slope may be helpful for prediction.
Hypothesis/Purpose
To develop prediction models using a range of machine learning methods to predict humeral torsion in professional baseball pitchers and compare these models to a previously developed regression-based prediction model.
Study Design
Prospective cohort
Methods
An eleven-year professional baseball cohort was recruited from 2009-2019. Age, arm dominance, injury history, and continent of origin were collected as well as preseason shoulder external and internal rotation, horizontal adduction passive range of motion, and humeral torsion were collected each season. Regression and machine learning models were developed to predict humeral torsion followed by internal validation with 10-fold cross validation. Root mean square error (RMSE), which is reported in degrees (°) and calibration slope (agreement of predicted and actual outcome; best = 1.00) were assessed.
Results
Four hundred and seven pitchers (Age: 23.2 +/-2.4 years, body mass index: 25.1 +/-2.3 km/m2, Left-Handed: 17%) participated. Regression model RMSE was 12° and calibration was 1.00 (95% CI: 0.94, 1.06). Random Forest RMSE was 9° and calibration was 1.33 (95% CI: 1.29, 1.37). Gradient boosting machine RMSE was 9° and calibration was 1.09 (95% CI: 1.04, 1.14). Support vector machine RMSE was 10° and calibration was 1.13 (95% CI: 1.08, 1.18). Artificial neural network RMSE was 15° and calibration was 1.03 (95% CI: 0.97, 1.09).
Conclusion
This is the first study to show that machine learning models do not improve baseball humeral torsion prediction compared to a traditional regression model. While machine learning models demonstrated improved RMSE compared to the regression, the machine learning models displayed poorer calibration compared to regression. Based on these results it is recommended to use a simple equation from a statistical model which can be quickly and efficiently integrated within a clinical setting.
Levels of Evidence
2
- Publication status:
- Published
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 181.6KB, Terms of use)
-
- Publisher copy:
- 10.26603/001c.32380
Authors
- Publisher:
- International Journal of Sports Physical Therapy
- Journal:
- International Journal of Sports Physical Therapy More from this journal
- Volume:
- 17
- Issue:
- 3
- Pages:
- 390-399
- Place of publication:
- United States
- Publication date:
- 2022-04-01
- Acceptance date:
- 2021-12-20
- DOI:
- EISSN:
-
2159-2896
- ISSN:
-
2159-2896
- Pmid:
-
35391864
- Language:
-
English
- Keywords:
- Pubs id:
-
1250590
- Local pid:
-
pubs:1250590
- Deposit date:
-
2025-03-17
- ARK identifier:
Terms of use
- Copyright holder:
- Bullock et al
- Copyright date:
- 2022
- Rights statement:
- © 2022 The Authors. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CCBY-NC-4.0). View this license’s legal deed at https://creativecommons.org/licenses/by-nc/4.0 and legal code at https://creativecommons.org/licenses/by-nc/4.0/legalcode for more information.
If you are the owner of this record, you can report an update to it here: Report update to this record