基于改进U-Net网络的内窥镜图像烟雾净化算法

doi:10.3969/j.issn.0258-8021.2021.03.05

摘要
图/表
参考文献
相关文章 (1)

全文: PDF (10645 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要在微创手术中,电灼、激光烧蚀等操作产生的烟雾严重影响图像质量,遮挡医生视野,增加手术风险,同时也降低计算机辅助手术算法(如分割、三维重建、跟踪等)的性能,因此需要实时去除烟雾,以保持清晰的视野。提出一种基于改进U-Net网络的烟雾净化算法:为了保留更多图像细节,在U-Net网络编码器部分加入经过拉普拉斯金字塔变换的烟雾内窥镜图像;为了提升网络性能,在U-Net网络解码器部分加入CBAM注意力机制模块。以英国汉姆林(Hamlyn)中心提供的腹腔镜图像为原始数据集(训练图像15 000张,合成烟雾测试图像1 000张,真实包含雾气的测试图像129张),采用Blender软件模拟手术过程中烟雾出现的各种情况,对腹腔镜图像加入烟雾,得到合成烟雾图像,再送入模型进行训练,并进行5折交叉验证。在合成数据集上的综合测试结果如下:结构相似性指标SSIM为0.98,峰值性噪比PSNR为31.05。这两项指标说明,经过烟雾净化的图像与原图非常相似,有助于手术中还原人体内部的真实视野。模型平均运行速度为90.91 fps,在浓雾和淡雾数据集上比物理方法和以对抗神经网络为基础的各种方法效果更好。所提出的方法可以在烟雾数据集稀缺的场景为内窥镜烟雾净化算法提供高质量的解决方案,有助于医生得到清晰、开阔的手术视野。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	林金朝
	蒋媚秋
	庞宇
	王慧倩

关键词 ：内窥镜图像, 烟雾净化, U-Net, 拉普拉斯金字塔, 注意力模型

Abstract：In minimally invasive surgery, the smoke generated by operations such as electrocautery and laser ablation seriously fade image quality, not only obstructs the doctor’s field of view and increases the risk of surgery, but also reduces the performance of computer-assisted surgery algorithms (such as segmentation, 3D reconstruction, tracking, etc.). Therefore, the smoke needs to be removed real time to maintain a clear vision. This paper proposed a desmoking algorithm based on the improved U-Net network. To retain more image details, we add images which undergone Laplace pyramid transformation to the encoder part; and to improve network's performance, we add attention mechanism module (CBAM) to decoder part. The laparoscopic image was provided by the Hamlyn Center and used as the original dataset (15000 training images, 1000 synthetic smoke test images, and 129 real smoke test images), and the Blender software was used to simulate various situations of smoke which was added to the laparoscopic image, and the composite image was obtained and sent to the improved U-Net model for training, and performed 5-fold cross-validation. We obtained a PSNR value of 31.05 and SSIM index of 0.98 on the composed dataset. These two indicators showed that the smoke-purified image was very similar to the original image, which helped restore the real vision of the human body during surgery. The average running time is 90.91 fps, which is applicable in a real-time medical system. The obtained results are better than the other six methods based on physical or based on GAN, therefore the proposed approach provided a high-quality solution for the endoscopic smoke removal algorithm, which helps doctors get a clear surgical field of vision.

Key words： endoscopic image desmoking U-Net Laplace pyramid transformation attention model

收稿日期: 2020-09-02

PACS:

R318

基金资助:国家自然科学基金重点项目(69735101);重庆市教委科学技术研究项目(KJQN201800614)

通讯作者: * E-mail: pangyu@cqupt.edu.cn

引用本文:

林金朝, 蒋媚秋, 庞宇, 王慧倩. 基于改进U-Net网络的内窥镜图像烟雾净化算法[J]. 中国生物医学工程学报, 2021, 40(3): 291-300.
Lin Jinzhao, Jiang Meiqiu, Pang Yu, Wang Huiqian. A Desmoking Algorithm for Endoscopic Images Based on Improved U-Net. Chinese Journal of Biomedical Engineering, 2021, 40(3): 291-300.

链接本文:

http://cjbme.csbme.org/CN/10.3969/j.issn.0258-8021.2021.03.05 或 http://cjbme.csbme.org/CN/Y2021/V40/I3/291

[1] 徐忠,刘洪英,皮喜田. 医用超细内窥镜系统研究 [J]. 中国生物医学工程学报, 2014, 33(1):107-111.
[2] 范姗慧,刘士臣,曹鹗. 无线胶囊内窥镜图像小肠息肉的自动识别 [J]. 中国生物医学工程学报, 2019, 38(5):522-532.
[3] He Kaiming, Sun Jian, Tang Xiaoou et al. Single image haze removal using dark channel prior [J]. IEEE Transactions on Pattern Analysis and machine Intelligence, 2011, 33(12): 2341-2353.
[4] Tchaka K, Pawar VM, Stoyanov D. Chromaticity based smoke removal in endoscopic images [C]//Medical Imaging 2017: Image Processing. 2017:101331M.1-101331M.8.
[5] Ancuti CO, Ancuti C. Single image dehazing by multi-scale fusion [J]. IEEE Transactions on Image Processing, 2013, 22(8): 3271-3282.
[6] Baid A, Kotwal A, Bhalodia R, et al. Joint desmoking, specularity removal, and denoising of laparoscopy images via graphical models and Bayesian inference[C]//2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). Melbourne: IEEE Press, 2017: 732-736.
[7] Kotwal A, Bhalodia R, Awate SP. Joint desmoking and denoising of laparoscopy images[C] //2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI 2016). Prague: IEEE Press, 2016, 1050-1054.
[8] Luo X, Mcleod AJ, Pautler SE, et al. Vision-based surgical field defogging [J]. IEEE Transactions on Medical Imaging, 2017, 36(10): 2021-2030.
[9] Ancuti CO, Ancuti C. Single image dehazing by multi-scale fusion [J]. IEEE Transactions on Image Processing, 2013, 22(8): 3271-3282.
[10] Cai Bolun, Xu Xiangmin, Jia Kui, et al. Dehazenet: An end-to-end system for single image haze removal [J]. IEEE Transactions on Image Processing, 2016, 25(11): 5187-5198.
[11] Yang Dong, Sun Jian, Proximal. dehaze-net: A prior learning-based deep network for single image dehazing [C] //Proceedings of the European Conference on Computer Vision (ECCV). Munich: IEEE Press, 2018: 702-717.
[12] Salazarcolores S, Albertomoreno H, Flores G, et al. Laparoscopy surgery CO₂ removal via generative adversary network and dark channel prior [EB/OL]. arxiv.org/abs/1909.12314, 2019-09-26/ 2020-09-02.
[13] Chen Long, Tang Wen, John NW. Unsupervised learning of surgical smoke removal from simulation [J]. Computer Assisted Surgery, 2018, 123(10): 3873-3879.
[14] Godard C, Aodha OM, Brostow GJ, et al. Unsupervised monocular depth estimation with left-right consistency[C] //Computer Vision and Pattern Recognition. Hawaii: IEEE Press, 2017: 6602-6611.
[15] Wang Congcong, Mohammed AK, Cheikh FA, et al. Multiscale deep desmoking for laparoscopic surgery[C]// Image Processing. Taipei: IEEE Press, 2019:1117-1126.
[16] Bolkar S, Wang Congcong, Cheikh FA, et al. Deep smoke removal from minimally invasive surgery videos[C] //2018 IEEE International Conference on Image Processing. Athens: IEEE Press, 2018: 3403-3407.
[17] Sidorov O, Wang Congcong, Cheikh FA, et al. Generative smoke removal[J]. Computer Vision and Pattern Recognition, 2019, 116:81-92.
[18] Chen Long, Tang Wen, John NW, et al. De-smokeGCN: Generative cooperative networks for joint surgical smoke detection and removal [J]. IEEE Transactions on Medical Imaging, 2019, 39(5): 1615-1625.
[19] Venkatesh V, Sharma N, Srivastava V, et al. Unsupervised smoke to desmoked laparoscopic surgery images using contrast driven Cyclic-DesmokeGAN [J]. Computers in Biology and Medicine, 2020, 123:1038-1046.
[20] Salazar-Colores, Sebastián H. Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel[EB/OL].arxiv.org/abs/2004.08947,2020-04-19/2020-09-20.
[21] Ronneberger O, Fischer P, Brox T, et al. U-Net: Convolutional networks for biomedical image segmentation [EB/OL] //lmb.informatik.uni-freiburg.de/people/ronneber/u-net/,2015/2020-09-02.
[22] Skinner KA, Zhang J, Olson EA, et al. UWStereoNet: Unsupervised learning for depth estimation and color correction of underwater stereo imagery[C]//International Conference on Robotics and Automation. Montreal: IEEE Press, 2019: 7947-7954.
[23] 陈浩,王延杰. 基于拉普拉斯金字塔变换的图像融合算法研究[J]. 激光与红外, 2009, 39(4):439-442.
[24] Woo S, Park J, Lee J, et al. CBAM: Convolutional block attention module[C]//European Conference on Computer Vision. Berlin: Springer-Verlag, 2018: 3-19.
[25] Shin J, Kim M, Paik J, et al. Radiance-reflectance combined optimization and structure-guided l0-norm for single image dehazing [J]. IEEE Transactions on Multimedia, 2019, 22(1):1-16.
[26] Isola P, Zhu Junyan, Zhou Tinghui, et al. Image-to-image translation with conditional adversarial networks[C]//Computer Vision and Pattern Recognition. Hawaii: IEEE Press, 2017: 5967-5976.
[27] Zhuang Liu, Mingjie Sun, Tinghui Zhou et al. Rethinking the value of network pruning [EB/OL]. arxiv.org/abs/1810.05270, 2019-10-11/2020-09-02.