Journal article
What machine learning teaches us about depression prediction across the life course: An exploratory comparison of predictive models
- Abstract:
- Identifying individuals at risk for depression early is important for preventing long-term mental health issues. However, the variability in depression severity, duration, and triggers complicates predictions. This study explores whether machine learning models can outperform traditional methods, like Logistic Regression, in predicting self-reported depressive symptoms and clinical depression during adolescence and adulthood. We applied five machine learning models with varying complexity levels - Logistic Regression, Decision Tree, XGBoost, Support Vector Machine, and Neural Networks - using data from a nationally representative longitudinal study of the U.S., which tracked participants for 20 years. The models were trained with early-life predictors (ages 12-18) from Wave I, including environmental factors (family, school, health) and genetic predispositions (polygenic scores) from Wave IV. Models were evaluated on their ability to predict depressive symptoms and clinical diagnoses in both adolescence and adulthood. After evaluating the performance of all five models, XGBoost emerged as the most effective, with a 0.02 increase in ROC-AUC compared to the benchmark Logistic Regression model. While this is a slight performance improvement, overall, Logistic Regression performs about as well as many of our ML models. Early-life data showed strong predictive value for depressive symptoms and clinical diagnoses in adolescence and adulthood, highlighting adolescence as a critical period. Polygenic scores do not add predictive power when combined with environmental data. Feature importance analyses identified self-perception and physical health as key predictors of depressive symptoms, while trauma and life-changing events were more influential for clinical depression.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.2MB, Terms of use)
-
- Publisher copy:
- 10.1016/j.ssmph.2025.101886
Authors
- Publisher:
- Elsevier
- Journal:
- SSM - Population Health More from this journal
- Volume:
- 32
- Pages:
- 101886
- Publication date:
- 2025-11-19
- Acceptance date:
- 2025-11-18
- DOI:
- EISSN:
-
2352-8273
- ISSN:
-
2352-8273
- Pmid:
-
41399528
- Language:
-
English
- Keywords:
- Pubs id:
-
2347684
- UUID:
-
uuid_0297479a-42e2-4dbc-9fe7-161552eb62eb
- Local pid:
-
pubs:2347684
- Source identifiers:
-
3592895
- Deposit date:
-
2025-12-24
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2025
If you are the owner of this record, you can report an update to it here: Report update to this record