Thesis icon

Thesis

Efficient large language models for the NHS and psychiatry

Abstract:

Electronic Health Records (EHRs) contain large volumes of unstructured clinical text, produced for a large variety of reasons. The clinical notes within EHR can be full of technical jargon and have a low signal-to-noise ratio. This poses a challenge for healthcare providers like the UK National Health Service (NHS), as making use of these texts to produce insights can be labour-intensive and requires domain expertise. This thesis explores methods for developing efficient, cost-effective, be- spoke Large Language Models (LLMs) to understand NHS clinical text, enabling improved patient management and treatment. An enhanced pretraining regime, using contrastive loss on NHS clinical data, enabled the creation of NHS-specific LLMs within a day on a single GPU. These models outperformed open-source LLMs, facilitating faster adaptation to downstream clinical NLP tasks.

Traditional LLM fine-tuning is computationally expensive and challenging with larger models. Efficient adaptation methods, such as prompt learning, were devel- oped and employed, reducing computational and storage requirements by up to 98% while maintaining state-of-the-art performance on several clinical NLP tasks. The bespoke NHS LLMs and efficient adaptation methods were applied to a digital triage system for secondary mental health referrals. This system aimed to improve transparency, accuracy, and efficiency in routing patients to appropriate care pathways based on their clinical information. The resulting model processed variable-length patient referral text and produced triage team recommendations with an explainability tool to enhance interpretability. Crucially, the triage model remained cost-effective and feasible in resource-constrained environments. This work evaluates the feasibility, utility, and potential benefits of developing specialized LLMs for NHS clinical text processing, discussing implications for enhancing patient care and clinical decision support.

Actions


Access Document


Files:

Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Psychiatry
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MSD
Department:
Psychiatry
Role:
Supervisor
ORCID:
0000-0001-9276-2720
Institution:
University of Oxford
Division:
MSD
Department:
Psychiatry
Role:
Supervisor


More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/S02428X/1
Programme:
Oxford EPSRC Centre for Doctoral Training in Health Data Science


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford


Language:
English
Keywords:
Subjects:
Deposit date:
2024-08-22

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP