Journal article icon

Journal article

Machine learning does not improve humeral torsion prediction compared to regression in baseball pitchers

Abstract:
Background
Humeral torsion is an important osseous adaptation in throwing athletes that can contribute to arm injuries. Currently there are no cheap and easy to use clinical tools to measure humeral torsion, inhibiting clinical assessment. Models with low error and “good” calibration slope may be helpful for prediction.
Hypothesis/Purpose
To develop prediction models using a range of machine learning methods to predict humeral torsion in professional baseball pitchers and compare these models to a previously developed regression-based prediction model.
Study Design
Prospective cohort
Methods
An eleven-year professional baseball cohort was recruited from 2009-2019. Age, arm dominance, injury history, and continent of origin were collected as well as preseason shoulder external and internal rotation, horizontal adduction passive range of motion, and humeral torsion were collected each season. Regression and machine learning models were developed to predict humeral torsion followed by internal validation with 10-fold cross validation. Root mean square error (RMSE), which is reported in degrees (°) and calibration slope (agreement of predicted and actual outcome; best = 1.00) were assessed.
Results
Four hundred and seven pitchers (Age: 23.2 +/-2.4 years, body mass index: 25.1 +/-2.3 km/m2, Left-Handed: 17%) participated. Regression model RMSE was 12° and calibration was 1.00 (95% CI: 0.94, 1.06). Random Forest RMSE was 9° and calibration was 1.33 (95% CI: 1.29, 1.37). Gradient boosting machine RMSE was 9° and calibration was 1.09 (95% CI: 1.04, 1.14). Support vector machine RMSE was 10° and calibration was 1.13 (95% CI: 1.08, 1.18). Artificial neural network RMSE was 15° and calibration was 1.03 (95% CI: 0.97, 1.09).
Conclusion
This is the first study to show that machine learning models do not improve baseball humeral torsion prediction compared to a traditional regression model. While machine learning models demonstrated improved RMSE compared to the regression, the machine learning models displayed poorer calibration compared to regression. Based on these results it is recommended to use a simple equation from a statistical model which can be quickly and efficiently integrated within a clinical setting.
Levels of Evidence
2
Publication status:
Published

Actions

Access Document

Files:
Publisher copy:
10.26603/001c.32380

Authors

More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Research group:
Centre for Sport, Exercise and Osteoarthritis
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Research group:
Centre for Sport, Exercise and Osteoarthritis
Role:
Author
ORCID:
0000-0002-2772-2316
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
NDORMS
Sub department:
Botnar Institute for Musculoskeletal Sciences
Research group:
Centre for Sport, Exercise and Osteoarthritis
Oxford college:
Lady Margaret Hall
Role:
Author
ORCID:
0000-0002-3452-3382


Publisher:
International Journal of Sports Physical Therapy
Journal:
International Journal of Sports Physical Therapy More from this journal
Volume:
17
Issue:
3
Pages:
390-399
Place of publication:
United States
Publication date:
2022-04-01
Acceptance date:
2021-12-20
DOI:
EISSN:
2159-2896
ISSN:
2159-2896
Pmid:
35391864


Language:
English
Keywords:
Pubs id:
1250590
Local pid:
pubs:1250590
Deposit date:
2025-03-17
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP