Journal article
Using natural language processing to extract self-harm and suicidality data from a clinical sample of patients with eating disorders: a retrospective cohort study
- Abstract:
- This study aims to improve the accuracy, speed, and safety of suicide risk assessment among adolescents in the digital ecosystems of smart cities. To achieve this goal, an integrated system architecture was developed that combines natural language processing methods, transformer models, and privacy-preserving computation. The methodological part includes large-scale textual data analysis, distributed processing in Apache Spark and Hadoop environments, and the use of federated learning, which allows models to be trained without transferring sensitive source information. The evaluation was conducted on open mental health datasets and supplemented by a series of experiments simulating the system's operation in real time, as well as surveys of specialists – psychologists, educators, and IT experts. The analysis showed that transformer models, particularly BERT, significantly outperform classical algorithms, achieving an AUC-ROC of 0.96 and an F1 score of 0.92 with an average response time of 2.4 seconds. Survey participants noted the importance of transparency and data protection, and the proposed architecture received high marks for reducing the risk of information leaks and providing robust audit mechanisms. The novelty of the work lies in the combination of predictive analytics, federated learning, differential privacy, and blockchain traceability in a single application-oriented system. The results show that ethically sound and rapid suicide risk detection can be implemented in schools, medical institutions, and municipal services, providing both practical benefits and contributing to methodological advancements
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 539.8KB, Terms of use)
-
- Publisher copy:
- 10.1136/bmjopen-2021-053808
Authors
- Publisher:
- BMJ Publishing Group
- Journal:
- BMJ Open More from this journal
- Volume:
- 11
- Issue:
- 12
- Pages:
- e053808-e053808
- Publication date:
- 2021-12-31
- Acceptance date:
- 2021-10-04
- DOI:
- EISSN:
-
2044-6055
- ISSN:
-
2044-6055
- Language:
-
English
- Keywords:
- Pubs id:
-
1233326
- Local pid:
-
pubs:1233326
- Source identifiers:
-
W4206158111
- Deposit date:
-
2026-04-09
- ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.
Terms of use
- Copyright date:
- 2021
- Licence:
- Other
If you are the owner of this record, you can report an update to it here: Report update to this record