Cai, Y; Droste, R; Sharma, H; Chatelain, P; Drukker, L; Papageorghiou, AT; Noble, JA
(2020)
Spatio-temporal visual attention modelling of standard biometry plane-finding navigation.
Med Image Anal, 65.
p. 101762.
ISSN 1361-8423
https://doi.org/10.1016/j.media.2020.101762
SGUL Authors: Papageorghiou, Aris
Abstract
We present a novel multi-task neural network called Temporal SonoEyeNet (TSEN) with a primary task to describe the visual navigation process of sonographers by learning to generate visual attention maps of ultrasound images around standard biometry planes of the fetal abdomen, head (trans-ventricular plane) and femur. TSEN has three components: a feature extractor, a temporal attention module (TAM), and an auxiliary video classification module (VCM). A soft dynamic time warping (sDTW) loss function is used to improve visual attention modelling. Variants of the model are trained on a dataset of 280 video clips, each containing one of the three biometry planes and lasting 3-7 seconds, with corresponding real-time recorded gaze tracking data of an experienced sonographer. We report the performances of the different variants of TSEN for visual attention prediction at standard biometry plane detection. The best model performance is achieved using bi-directional convolutional long-short term memory (biCLSTM) in both TAM and VCM, and it outperforms a previous spatial model on all static and dynamic saliency metrics. As an auxiliary task to validate the clinical relevance of the visual attention modelling, the predicted visual attention maps were used to guide standard biometry plane detection in consecutive US video frames. All spatio-temporal TSEN models achieve higher scores compared to a spatial-only baseline; the best performing TSEN model achieves F1 scores on these standard biometry planes of 83.7%, 89.9% and 81.1%, respectively.
Item Type: |
Article
|
Additional Information: |
© 2020 The Authors. Published by Elsevier B.V.
This is an open access article under the CC BY-NC-ND license.
(http://creativecommons.org/licenses/by-nc-nd/4.0/) |
Keywords: |
Fetal ultrasound, Gaze tracking, Multi-task learning, Saliency prediction, Standard plane detection, Biometry, Head, Humans, Neural Networks, Computer, Ultrasonography, Head, Humans, Ultrasonography, Biometry, Neural Networks, Computer, Fetal ultrasound, Gaze tracking, Multi-task learning, Saliency prediction, Standard plane detection, 09 Engineering, 11 Medical and Health Sciences, Nuclear Medicine & Medical Imaging |
SGUL Research Institute / Research Centre: |
Academic Structure > Institute of Medical, Biomedical and Allied Health Education (IMBE) Academic Structure > Institute of Medical, Biomedical and Allied Health Education (IMBE) > Centre for Clinical Education (INMECE ) |
Journal or Publication Title: |
Med Image Anal |
ISSN: |
1361-8423 |
Language: |
eng |
Dates: |
Date | Event |
---|
2 July 2020 | Published | 20 June 2020 | Published Online | 18 June 2020 | Accepted |
|
Publisher License: |
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0 |
Projects: |
|
PubMed ID: |
32623278 |
Web of Science ID: |
WOS:000567866400008 |
|
Go to PubMed abstract |
URI: |
https://openaccess.sgul.ac.uk/id/eprint/114674 |
Publisher's version: |
https://doi.org/10.1016/j.media.2020.101762 |
Statistics
Item downloaded times since 17 Aug 2022.
Actions (login required)
|
Edit Item |