关键点引导的时序网络用于超声心动图分割

doi:10.3969/j.issn.0258-8021.2024.06.001

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (7604 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要超声心动图分割是先天性心脏病筛查中的一个重要步骤。然而,超声心动图图像质量较低,并且因心脏跳动,超声心动视频部分帧中的一些心脏关键结构会模糊或消失。对于结构消失的目标帧,通常需要依靠超声心动视频中其他结构清晰的帧来推导确定目标帧中关键结构的位置。在此基础上,本研究设计了一个关键点引导的时序网络来完成超声心动图的分割。具体来说,对于要分割的目标帧,使用超声视频中的其他帧作为辅助帧。首先设计了一个双向时序网络,通过关键点引导网络从辅助帧中提取关键结构信息。然后提出了一种Transformer时间注意力模块,调整各辅助帧的特征权重,关注结构清晰的辅助帧。此外,提出了图像映射模块,将辅助帧的结构信息直接映射到目标帧,完成了对目标帧中缺失结构信息的补充。在98例胸骨旁短轴切面数据上进行了实验,平均Dice达到0.826 9。实验证明所提出的方法能够有效应用于超声心动图分割中。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	向卓
	陈伟玲
	田晓雨
	赵程
	汪天富
	雷柏英

关键词 ：超声心动图分割, 关键点定位, Transformer时间注意力, 双向时序网络, 图像映射

Abstract：Echocardiographic segmentation is an important step in the screening of congenital heart disease. However, the quality of echocardiogram image usually is relatively low, and some key heart structures in the echocardiogram video portion of the frame can blur or disappear due to the beating of the heart. For the target frame whose structure disappears, it is usually necessary to deduce the position of the key structure in the target frame by relying on other frames with clear structure in the echocardiography video. Aiming to address these challenges, this study designed a key point guided timing network to complete the segmentation of echocardiography. Specifically, for the target frame to be segmented, other frames in the ultrasonic video were used as secondary frames. First, a bidirectional temporal network (BTN) was designed to extract the structure information from the auxiliary frame, and in this process, the key points guided the network to extract the key structure information. Then, a transformer temporal attention (TTA) module was proposed to adjust the feature weights of each auxiliary frame and focus on the auxiliary frame with clear structure. In addition, this study proposed an image mapping (IM) module, which mapped the structure information of the auxiliary frame directly to the target frame and completed the supplement of missing structure information in the target frame. In this study, experiments were conducted on the parasternal short axis section data of 98 cases, and the average Dice reached 0.8269. Experimental results showed that the proposed method could be effectively applied to echocardiogram segmentation.

Key words： echocardiography segmentation key point prediction transformer temporal attention bidirectional temporal net-work image mapping

收稿日期: 2024-01-03

PACS:

R318

基金资助:广东省自然科学基金(2022A1515110704);中国博士后科学基金(2023M732358);国家自然科学基金(62301329)

通讯作者: ^*E-mail:leiby@szu.edu.cn

引用本文:

向卓, 陈伟玲, 田晓雨, 赵程, 汪天富, 雷柏英. 关键点引导的时序网络用于超声心动图分割[J]. 中国生物医学工程学报, 2024, 43(6): 641-651.
Xiang Zhuo, Chen Weiling, Tian Xiaoyu, Zhao Cheng, Wang Tianfu, Lei Baiying. Key Point-Guided Temporal Network Used for Segmentation of Echocardiography. Chinese Journal of Biomedical Engineering, 2024, 43(6): 641-651.

链接本文:

http://cjbme.csbme.org/CN/10.3969/j.issn.0258-8021.2024.06.001 或 http://cjbme.csbme.org/CN/Y2024/V43/I6/641

[1] Zimmerman MS, Smith AGC, Sable CA, et al. Global, regional, and national burden of congenital heart disease, 1990–2017: a systematic Analysis for the Global Burden of Disease Study 2017[J]. The Lancet Child & Adolescent Health, 2020, 4:185-200.
[2] Avendi MR, Kheradvar A, Jafarkhani H. A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac mri[J]. Medical Image Analysis, 2016, 30:108-119.
[3] Ngo TA, Lu Z, Carneiro G Combining deep learning and level set for the automated segmentation of the left ventricle of the heart from cardiac cine magnetic resonance[J]. Medical Image Analysis, 2017, 35:159-171.
[4] Guo Libao, Lei Baiying, Chen Weiling, et al. Dual attention enhancement feature fusion network for segmentation and quantitative analysis of paediatric echocardiography[J]. Medical Image Analysis, 2021, 71:102042.
[5] Li Kai, Wang Shujun, Yu Lequan, et al. Dual-teacher++: exploiting intra-domain and inter-domain knowledge with reliable transfer for cardiac segmentation[J]. IEEE Transactions on Medical Imaging, 2020, 40(10): 2771-2782.
[6] Zhao Cheng, Chen Weiling, Qin Jin, et al. IFT-net: interactive fusion transformer network for quantitative analysis of pediatric echocardiography[J]. Medical Image Analysis, 2022, 82:102648.
[7] Leclerc S, Smistad E, Østvik A, et al. Lu-net: a multistage attention network to improve the robustness of segmentation of left ventricular structures in 2-D echocardiography[J]. IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, 2020, 67:2519-2530.
[8] Hu Yujin, Xia Bei, Mao Muyi, et al. Aidan: an attention-guided dual-path network for pediatric echocardiography segmentation[J]. IEEE Access, 2020, 8:29176-29187.
[9] Yu Changqian, Wang Jjingbo, Chao Peng, et al. Bisenet: bilateral segmentation network for real-time semantic segmentation[C] // Proceedings of the European Conference on Computer Vision (ECCV). Munich: ECCV, 2018: 325-341.
[10] Valanarasu JMJ, Oza P, Hacihaliloglu I, et al. Medical transformer: gated axial-attention for medical image segmentation[C] // Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference. Strasbourg: MICCAI, 2021: 36-46.
[11] 吴宣言, 缑新科, 朱子重,等. 深层聚合残差密集网络的超声图像左心室分割[J]. 中国图象图形学报, 2020, 25(9):1930-1942.
[12] 葛帅, 严加勇, 谢利剑,等. 改进型 U-Net 网络的左心室超声心动图像分割[J]. 软件导刊, 2021, 20(2): 206-209.
[13] 胡玉进, 雷柏英, 郭力宝,等. 基于 bisenet 的小儿超声心动图左心分割方法[J]. 中国生物医学工程学报, 2019, 38(5): 533-539.
[14] 俊庞, 永雄王, 丽君陈,等. 面向儿科超声心动图双侧心室分割的注意力引导网络[J]. 生物医学工程学杂志, 2023, 40(5):928-937.
[15] Ke Sun, Xiao Bin, Liu Dong, et al. Deep high-resolution representation learning for human pose estimation[C] // 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: CVPR, 2019: 5686-5696.
[16] Cao Zhe, Simon T, Wei SE, et al. Realtime multi-person 2d pose estimation using part affinity fields[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: CVPR, 2017: 7291-7299.
[17] Payer C, Štern D, Bischof H, et al. Regressing heatmaps for multiple landmark localization using cnns[C]// Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference. Athens: MICCAI, 2016: 230-238.
[18] Ta KMT. Multi-task learning for cardiac motion analysis and segmentation in echocardiography [D]. New Haven: Yale University, 2023.
[19] Dozen A, Komatsu M, Sakai A, et al. Image segmentation of the ventricular septum in fetal cardiac ultrasound videos based on deep learning using time-series information[J]. Biomolecules, 2020, 10(11): 1526.
[20] Chen Zejian, Wei Zhuo, Wang Tianfu, et al. Semi-supervised representation learning for segmentation on medical volumes and sequences[J]. IEEE Transactions on Medical Imaging, 42(12): 3972-3986, 2023.
[21] Cho K, Van Merriënboer B, Gulcehre C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[EB/OL]. https://arxiv.org/abs/1406.1078, 2014-09-03/2024-01-03.
[22] Wang Xinyao, Bo Liefeng, Li Fuxin. Adaptive wing loss for robust face alignment via heatmap regression[C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul: ICCV, 2019: 6971-6981.
[23] Hu Jie, Shen Li, Sun Gang. Squeeze-and-excitation networks[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: CVPR, 2018: 7132-7141.
[24] Wang Qilong, Wu Banggu, Zhu Pengfei, et al. Eca-net: efficient channel attention for deep convolutional neural networks[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle: CVPR, 2020: 11534-11542.
[25] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Advances in Neural Information Processing Systems, 2017, 30: 6000-6010.
[26] Wang Panqu, Chen Pengfei, Yuan Ye, et al. Understanding convolution for semantic segmentation[C]// 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). Lake Tahoe: WACV, 2018: 1451-1460.
[27] Chen LC, Zhu Yukun, Papandreou G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C]// Proceedings of the European Conference on Computer Vision (ECCV). Munich: ECCV, 2018: 801-818.
[28] Zhao Hengshuang, Shi Jianping, Qi Xiaojuan, et al. Pyramid scene parsing network[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: CVPR, 2017: 2881-2890.
[29] Fu Jun, Liu Jing, Tian Haijie, et al. Dual attention network for scene segmentation[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: CVPR, 2019: 3146-3154.
[30] Badrinarayanan V, Kendall A, Cipolla R Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39:2481-2495.
[31] Dong Shunjie, Zhao Jinlong, Zhang Maojun, et al. Deu-net: Ddeformable U-Net for 3D cardiac mri video segmentation[C]// Medical Image Computing and Computer Assisted Intervention–MICCAI 2020: 23rd International Conference. Lima: MICCAI, 2020: 98-107.
[32] Zheng Qiao, Delingette H, Duchateau N, et al. 3-D consistent and robust segmentation of cardiac images by deep learning with spatial propagation[J]. IEEE Transactions on Medical Imaging, 2018, 37:2137-2148.
[33] 徐佳陈, 肖志勇. 心脏动态 MRI 图像分割的时空多尺度网络[J]. 中国图象图形学报, 2022, 27(3):862-872.
[34] 尹慧平, 张耀楠, 何颖. 基于 CT 心脏图像的腔体区域分割新算法[J]. 计算机系统应用, 2017:26(11):292-295.
[35] 刘畅, 林楠, 曹仰杰,等. Seg-capnet: 心脏 MRI 图像分割神经网络模型[J]. 中国图象图形学报, 2021, 26(2): 452-463.
[36] Liang Jiajun, Pan Huijuan, Xiang Zhuo, et al. Echocardiographic segmentation based on semi-supervised deep learning with attention mechanism[J]. Multimedia Tools and Applications, 2024, 83(12): 36953-36973.