Conference item
Ultrasound image representation learning by modeling sonographer visual attention
- Abstract:
- Image representations are commonly learned from class labels, which are a simplistic approximation of human image understanding. In this paper we demonstrate that transferable representations of images can be learned without manual annotations by modeling human visual attention. The basis of our analyses is a unique gaze tracking dataset of sonographers performing routine clinical fetal anomaly screenings. Models of sonographer visual attention are learned by training a convolutional neural network (CNN) to predict gaze on ultrasound video frames through visual saliency prediction or gaze-point regression. We evaluate the transferability of the learned representations to the task of ultrasound standard plane detection in two contexts. Firstly, we perform transfer learning by fine-tuning the CNN with a limited number of labeled standard plane images. We find that fine-tuning the saliency predictor is superior to training from random initialization, with an average F1-score improvement of 9.6% overall and 15.3% for the cardiac planes. Secondly, we train a simple softmax regression on the feature activations of each CNN layer in order to evaluate the representations independently of transfer learning hyper-parameters. We find that the attention models derive strong representations, approaching the precision of a fully-supervised baseline model for all but the last layer.
- Publication status:
- Published
- Peer review status:
- Reviewed (other)
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 3.3MB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-030-20351-1_46
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Grant:
- EP/R013853/1
- EP/M013774/1
- Publisher:
- Springer
- Host title:
- Information Processing in Medical Imaging
- Journal:
- Information Processing in Medical Imaging More from this journal
- Publication date:
- 2019-05-22
- Acceptance date:
- 2019-02-26
- Event location:
- The Hong Kong University of Science and Technology (HKUST)
- DOI:
- Keywords:
- Pubs id:
-
pubs:980414
- UUID:
-
uuid:a27fe42b-3a94-4b0f-bc7d-2173c0348b6f
- Local pid:
-
pubs:980414
- Source identifiers:
-
980414
- Deposit date:
-
2019-03-07
Terms of use
- Copyright holder:
- Springer Nature
- Copyright date:
- 2019
- Notes:
- © Springer Nature Switzerland AG 2019. This is the accepted manuscript version of the article. The final version is available online from Springer Nature at: https://doi.org/10.1007/978-3-030-20351-1_46
If you are the owner of this record, you can report an update to it here: Report update to this record