Journal article icon

Journal article

Named entity recognition in electronic health records using transfer learning bootstrapped neural networks

Abstract:
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1016/j.neunet.2019.08.032

Authors


More by this author
Institution:
University of Oxford
Division:
MSD
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Department:
Psychiatry
Role:
Author
ORCID:
0000-0003-3555-9181
More by this author
Institution:
University of Oxford
Division:
MSD
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MSD
Role:
Author


Publisher:
Elsevier
Journal:
Neural Networks More from this journal
Volume:
121
Pages:
132-139
Publication date:
2019-09-06
Acceptance date:
2019-08-29
DOI:
EISSN:
1879-2782
ISSN:
0893-6080


Language:
English
Keywords:
Pubs id:
1083986
Local pid:
pubs:1083986
Deposit date:
2020-04-13

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP