Uncertainty-Guided Reliability Enhancement of Residual U-Net CT Segmentation in Medical Cancer Imaging
Abstract
Reliable Computed Tomography (CT) segmentation is a critical requirement for quantitative imaging and computer-aided clinical workflows. Although Residual U-Net (Res-U-Net) architectures achieve strong overlap performance on curated datasets, threshold-based binarization of probability maps often produces scattered false positives, particularly in low-contrast regions and near complex anatomical boundaries. This study analyses the role of predictive uncertainty in improving structural reliability of CT segmentation outputs. Monte-Carlo dropout is employed at inference time to estimate pixel-wise predictive variance, which is combined with mean probability and component size information within a connected-component framework. A component-level scoring rule is evaluated to suppress unstable, low-confidence regions while preserving coherent anatomical structures. Quantitative experiments demonstrate that uncertainty-aware filtering substantially reduces region-level false positives per scan and improves boundary stability, while maintaining competitive Dice and Intersection over Union (IoU) scores. An ablation study further shows that uncertainty penalization is the primary driver of false-positive reduction, and that combining uncertainty with mild size regularization yields the most balanced performance. The results support the use of uncertainty-guided refinement as a practical reliability layer for Res-U-Net–based CT segmentation systems.
Keywords:
Computed tomography, Lung segmentation, Residual U-Net, Uncertainty estimation, Monte-Carlo dropout, False positive removalReferences
- [1] Kalender, W. A. (2011). Computed tomography: Fundamentals, system technology, image quality, applications. John Wiley & Sons. https://api.pageplace.de/preview/DT0400.9783895786440_A24636683/preview-9783895786440_A24636683.pdf
- [2] Team, N. L. S. T. R. (2011). Reduced lung-cancer mortality with low-dose computed tomographic screening. New england journal of medicine, 365(5), 395–409. https://doi.org/10.1056/NEJMoa1102873
- [3] Litjens, G., Kooi, T., Bejnordi, B. E., Setio, A. A. A., Ciompi, F., Ghafoorian, M., ... & Sánchez, C. I. (2017). A survey on deep learning in medical image analysis. Medical image analysis, 42, 60-88. https://doi.org/10.1016/j.media.2017.07.005
- [4] Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International conference on medical image computing and computer-assisted intervention (pp. 234-241). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-24574-4_28
- [5] He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778). IEEE. https://doi.org/10.1109/CVPR.2016.90
- [6] Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J., & Maier-Hein, K. H. (2021). nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2), 203–211. https://doi.org/10.1038/s41592-020-01008-z
- [7] Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., & Ronneberger, O. (2016). 3D U-Net: Learning dense volumetric segmentation from sparse annotation. International conference on medical image computing and computer-assisted intervention (pp. 424-432). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-46723-8_49
- [8] Alom, M. Z., Hasan, M., Yakopcic, C., Taha, T. M., & Asari, V. K. (2018). Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation. https://doi.org/10.48550/arXiv.1802.06955
- [9] Lancaster, H. L., Heuvelmans, M. A., & Oudkerk, M. (2022). Low-dose computed tomography lung Cancer screening: Clinical evidence and implementation research. Journal of internal medicine, 292(1), 68–80. https://doi.org/10.1111/joim.13480
- [10] Dice, L. R. (1945). Measures of the amount of ecologic association between species. Ecology, 26(3), 297–302. https://doi.org/10.2307/1932409
- [11] Sorenson, T. (1948). A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol skar, 5(4), 1-34. https://www.semanticscholar.org/paper/A-method-of-establishing-group-of-equal-amplitude-Sørensen-Sørensen/d8d3e6d95b60ec6ac8f91f42a6914a87b13a6bc1
- [12] Serra, J. (1983). Image analysis and mathematical morphology. Academic Press, Inc. https://books.google.com/books/about/Image_Analysis_and_Mathematical_Morpholo.html?id=BpdTAAAAYAAJ
- [13] Dubuisson, M. P., & Jain, A. K. (1994). A modified Hausdorff distance for object matching. Proceedings of 12th international conference on pattern recognition (Vol. 1, pp. 566-568). IEEE. https://www.semanticscholar.org/paper/A-modified-Hausdorff-distance-for-object-matching-Dubuisson-Jain/3dc1e5fbf7842c214554aac02343cfd1b44ea435
- [14] Guo, Z., Guo, N., Gong, K., Zhong, S., & Li, Q. (2019). Gross tumor volume segmentation for head and neck Cancer radiotherapy using deep dense multi-modality network. Physics in medicine & biology, 64(20), 205015. https://doi.org/10.1088/1361-6560/ab440d
- [15] Gal, Y., & Ghahramani, Z. (2016). Dropout as a bayesian approximation: Representing model uncertainty in deep learning. International conference on machine learning (pp. 1050-1059). PMLR. https://proceedings.mlr.press/v48/gal16.pdf
- [16] Kendall, A., & Gal, Y. (2017). What uncertainties do we need in bayesian deep learning for computer vision? Advances in neural information processing systems (Vol. 30, pp. 5574–5584). Curran Associates, Inc. https://proceedings.neurips.cc/paper/2017/hash/2650d6089a6d640c5e85b2b88265dc2b-Abstract.html
- [17] Kendall, A., Badrinarayanan, V., & Cipolla, R. (2015). Bayesian segnet: Model uncertainty in deep convolutional encoder-decoder architectures for scene understanding. https://doi.org/10.48550/arXiv.1511.02680
- [18] Lakshminarayanan, B., Pritzel, A., & Blundell, C. (2017). Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems, 30, 1-12. https://proceedings.neurips.cc/paper_files/paper/2017/file/9ef2ed4b7fd2c810847ffa5fa85bce38-Paper.pdf
- [19] Kohl, S., Romera-Paredes, B., Meyer, C., De Fauw, J., Ledsam, J. R., Maier-Hein, K., ... & Ronneberger, O. (2018). A probabilistic u-net for segmentation of ambiguous images. Advances in neural information processing systems (Vol. 31, pp. 6965–6975). Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2018/file/473447ac58e1cd7e96172575f48dca3b-Paper.pdf
- [20] Khanna, A., Londhe, N. D., Gupta, S., & Semwal, A. (2020). A deep Residual U-Net convolutional neural network for automated lung segmentation in computed tomography images. Biocybernetics and biomedical engineering, 40(3), 1314–1327. https://doi.org/10.1016/j.bbe.2020.07.007
- [21] Miok, K., Nguyen-Doan, D., Zaharie, D., & Robnik-Šikonja, M. (2019). Generating data using Monte Carlo dropout. 2019 IEEE 15th international conference on intelligent computer communication and processing (ICCP) (pp. 509-515). IEEE. https://doi.org/10.1109/ICCP48234.2019.8959787