UTMap: Triplet Neural Network for Uncertainty Medical Image Analysis
Abstract
This paper proposes a framework to enhance the reliability of deep classifiers in medical imaging by combining visualization and uncertainty quantification through meta-learning. The methodology employs a Triplet Neural Network (UTMap) to project instances into the Instance Uncertainty Space (IUS), which highlights patterns of confidence and uncertainty. Additionally, the Neighborhood Reliability Score (NRS) metric is introduced to estimate uncertainty based on the spatial relationships within the IUS. Experimental results show that the IUS effectively represents classifier behavior and that the NRS achieves competitive performance compared to traditional uncertainty estimation algorithms in distinguishing between correct and incorrect predictions.References
Ahmed et al. (2023). A systematic review of the barriers to the implementation of artificial intelligence in healthcare. Cureus, 15(10).
Al-Dhabyani, W., Gomaa, et al. (2020). Dataset of breast ultrasound images. Data in brief, 28:104863.
Algan, G. and Ulusoy, I. (2021). Image classification with deep learning in the presence of noisy labels: A survey. Knowledge-Based Systems, 215:106771.
Bajwa, J., Munir, et al. (2021). Artificial intelligence in healthcare: transforming the practice of medicine. Future healthcare journal, 8(2):e188–e194.
Benjamin Lambert, F. F. et al. (2024). Trustworthy clinical ai solutions: A unified review of uncertainty quantification in deep learning models for medical image analysis. Artificial Intelligence in Medicine, 150:102830.
Gal, Y. and Ghahramani, Z. (2016). Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning, pages 1050–1059. PMLR.
Gayathri, J., Abraham, B., Sujarani, M., and Nair, M. S. (2022). A computer-aided diagnosis system for the classification of covid-19 and non-covid-19 pneumonia on chest x-ray images by integrating cnn with sparse autoencoder and feed forward neural network. Computers in biology and medicine, 141:105134.
Goceri, E. (2023). Medical image data augmentation: techniques, comparisons and interpretations. Artificial Intelligence Review, 56(11):12561–12605.
Goodfellow, I. J., Shlens, J., and Szegedy, C. (2015). Explaining and harnessing adversarial examples.
Hassan, M., Kushniruk, A., and Borycki, E. (2024). Barriers to and facilitators of artificial intelligence adoption in health care: scoping review. JMIR Human Factors, 11:e48633.
He, K. et al. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778.
He, X. et al. (2018). Triplet-center loss for multi-view 3d object retrieval. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1945–1954.
Hendrycks, D. and Gimpel, K. (2016). A baseline for detecting misclassified and out-of-distribution examples in neural networks. arXiv preprint arXiv:1610.02136.
Hoffer, E. and Ailon, N. (2015). Deep metric learning using triplet network. In Similarity-based pattern recognition: third international workshop, SIMBAD 2015, Copenhagen, Denmark, October 12-14, 2015. Proceedings 3, pages 84–92. Springer.
Hoffman, J. et al. (2025). Overcoming barriers and enabling artificial intelligence adoption in allied health clinical practice: A qualitative study. Digital Health, 11:20552076241311144.
Huisman, M., Van Rijn, J. N., and Plaat, A. (2021). A survey of deep meta-learning. Artificial Intelligence Review, 54(6):4483–4541.
Ioffe, S. and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning, pages 448–456. pmlr.
Islam, T. et al. (2024). A systematic review of deep learning data augmentation in medical imaging: Recent advances and future research directions. Healthcare Analytics, 5:100340.
Kaya, M. and Bilge, H. Ş. (2019). Deep metric learning: A survey. Symmetry, 11(9):1066.
Kingma, D. P. and Ba, J. (2015). Adam: A method for stochastic optimization. international conference on learning representations (2015). San Diego, California.
Koch et al. (2015). Siamese neural networks for one-shot image recognition. In ICML deep learning workshop, volume 2, pages 1–30. Lille.
Lakshminarayanan, B., Pritzel, A., and Blundell, C. (2017). Simple and scalable predictive uncertainty estimation using deep ensembles. In Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learning. nature, 521(7553):436–444.
Lee, J.-G. et al. (2017). Deep learning in medical imaging: general overview. Korean journal of radiology, 18(4):570.
Ling Huang, S. R. et al. (2024). A review of uncertainty quantification in medical image analysis: Probabilistic and non-probabilistic methods. Medical Image Analysis, 97:103223.
Mei, X. et al. (2020). Artificial intelligence–enabled rapid diagnosis of patients with covid-19. Nature medicine, 26(8):1224–1228.
Nair, V. and Hinton, G. E. (2010). Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), pages 807–814.
Nguyen, A., Yosinski, J., and Clune, J. (2015). Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 427–436.
Poon, E. G. et al. (2025). Adoption of artificial intelligence in healthcare: survey of health system priorities, successes, and challenges. Journal of the American Medical Informatics Association, 32(7):1093–1100.
Russakovsky, O., Deng, et al. (2015). Imagenet large scale visual recognition challenge. International journal of computer vision, 115:211–252.
Sartaj Bhuvaji, A. K. o. (2020). Brain tumor classification (mri).
Schroff, F., Kalenichenko, D., and Philbin, J. (2015). Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823.
Shorten, C. and Khoshgoftaar, T. M. (2019). A survey on image data augmentation for deep learning. Journal of big data, 6(1):1–48.
Simo-Serra, E. et al. (2015). Discriminative learning of deep convolutional feature point descriptors. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
Smith, M. R., Martinez, T., and Giraud-Carrier, C. (2014). An instance level analysis of data complexity. Machine learning, 95:225–256.
Snell, J., Swersky, K., and Zemel, R. (2017). Prototypical networks for few-shot learning. Advances in neural information processing systems, 30.
Soares et al. Sars-cov-2 ct-scan dataset: A large dataset of real patients ct scans for sars-cov-2 identification. medrxiv 2020. Google Scholar.
Tian, Y., Zhao, X., and Huang, W. (2022). Meta-learning approaches for learning-to-learn in deep learning: A survey. Neurocomputing, 494:203–223.
Vinyals et al. (2016). Matching networks for one shot learning. Advances in neural information processing systems, 29.
Wen, Y. et al. (2016). A discriminative feature learning approach for deep face recognition. In Computer vision–ECCV 2016: 14th European conference, amsterdam, the netherlands, October 11–14, 2016, proceedings, part VII 14, pages 499–515. Springer.
Yosinski, J., Clune, J., Bengio, Y., and Lipson, H. (2014). How transferable are features in deep neural networks? Zeiler, M. D. and Fergus, R. (2013). Visualizing and understanding convolutional networks.
Al-Dhabyani, W., Gomaa, et al. (2020). Dataset of breast ultrasound images. Data in brief, 28:104863.
Algan, G. and Ulusoy, I. (2021). Image classification with deep learning in the presence of noisy labels: A survey. Knowledge-Based Systems, 215:106771.
Bajwa, J., Munir, et al. (2021). Artificial intelligence in healthcare: transforming the practice of medicine. Future healthcare journal, 8(2):e188–e194.
Benjamin Lambert, F. F. et al. (2024). Trustworthy clinical ai solutions: A unified review of uncertainty quantification in deep learning models for medical image analysis. Artificial Intelligence in Medicine, 150:102830.
Gal, Y. and Ghahramani, Z. (2016). Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning, pages 1050–1059. PMLR.
Gayathri, J., Abraham, B., Sujarani, M., and Nair, M. S. (2022). A computer-aided diagnosis system for the classification of covid-19 and non-covid-19 pneumonia on chest x-ray images by integrating cnn with sparse autoencoder and feed forward neural network. Computers in biology and medicine, 141:105134.
Goceri, E. (2023). Medical image data augmentation: techniques, comparisons and interpretations. Artificial Intelligence Review, 56(11):12561–12605.
Goodfellow, I. J., Shlens, J., and Szegedy, C. (2015). Explaining and harnessing adversarial examples.
Hassan, M., Kushniruk, A., and Borycki, E. (2024). Barriers to and facilitators of artificial intelligence adoption in health care: scoping review. JMIR Human Factors, 11:e48633.
He, K. et al. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778.
He, X. et al. (2018). Triplet-center loss for multi-view 3d object retrieval. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1945–1954.
Hendrycks, D. and Gimpel, K. (2016). A baseline for detecting misclassified and out-of-distribution examples in neural networks. arXiv preprint arXiv:1610.02136.
Hoffer, E. and Ailon, N. (2015). Deep metric learning using triplet network. In Similarity-based pattern recognition: third international workshop, SIMBAD 2015, Copenhagen, Denmark, October 12-14, 2015. Proceedings 3, pages 84–92. Springer.
Hoffman, J. et al. (2025). Overcoming barriers and enabling artificial intelligence adoption in allied health clinical practice: A qualitative study. Digital Health, 11:20552076241311144.
Huisman, M., Van Rijn, J. N., and Plaat, A. (2021). A survey of deep meta-learning. Artificial Intelligence Review, 54(6):4483–4541.
Ioffe, S. and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning, pages 448–456. pmlr.
Islam, T. et al. (2024). A systematic review of deep learning data augmentation in medical imaging: Recent advances and future research directions. Healthcare Analytics, 5:100340.
Kaya, M. and Bilge, H. Ş. (2019). Deep metric learning: A survey. Symmetry, 11(9):1066.
Kingma, D. P. and Ba, J. (2015). Adam: A method for stochastic optimization. international conference on learning representations (2015). San Diego, California.
Koch et al. (2015). Siamese neural networks for one-shot image recognition. In ICML deep learning workshop, volume 2, pages 1–30. Lille.
Lakshminarayanan, B., Pritzel, A., and Blundell, C. (2017). Simple and scalable predictive uncertainty estimation using deep ensembles. In Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learning. nature, 521(7553):436–444.
Lee, J.-G. et al. (2017). Deep learning in medical imaging: general overview. Korean journal of radiology, 18(4):570.
Ling Huang, S. R. et al. (2024). A review of uncertainty quantification in medical image analysis: Probabilistic and non-probabilistic methods. Medical Image Analysis, 97:103223.
Mei, X. et al. (2020). Artificial intelligence–enabled rapid diagnosis of patients with covid-19. Nature medicine, 26(8):1224–1228.
Nair, V. and Hinton, G. E. (2010). Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), pages 807–814.
Nguyen, A., Yosinski, J., and Clune, J. (2015). Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 427–436.
Poon, E. G. et al. (2025). Adoption of artificial intelligence in healthcare: survey of health system priorities, successes, and challenges. Journal of the American Medical Informatics Association, 32(7):1093–1100.
Russakovsky, O., Deng, et al. (2015). Imagenet large scale visual recognition challenge. International journal of computer vision, 115:211–252.
Sartaj Bhuvaji, A. K. o. (2020). Brain tumor classification (mri).
Schroff, F., Kalenichenko, D., and Philbin, J. (2015). Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 815–823.
Shorten, C. and Khoshgoftaar, T. M. (2019). A survey on image data augmentation for deep learning. Journal of big data, 6(1):1–48.
Simo-Serra, E. et al. (2015). Discriminative learning of deep convolutional feature point descriptors. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
Smith, M. R., Martinez, T., and Giraud-Carrier, C. (2014). An instance level analysis of data complexity. Machine learning, 95:225–256.
Snell, J., Swersky, K., and Zemel, R. (2017). Prototypical networks for few-shot learning. Advances in neural information processing systems, 30.
Soares et al. Sars-cov-2 ct-scan dataset: A large dataset of real patients ct scans for sars-cov-2 identification. medrxiv 2020. Google Scholar.
Tian, Y., Zhao, X., and Huang, W. (2022). Meta-learning approaches for learning-to-learn in deep learning: A survey. Neurocomputing, 494:203–223.
Vinyals et al. (2016). Matching networks for one shot learning. Advances in neural information processing systems, 29.
Wen, Y. et al. (2016). A discriminative feature learning approach for deep face recognition. In Computer vision–ECCV 2016: 14th European conference, amsterdam, the netherlands, October 11–14, 2016, proceedings, part VII 14, pages 499–515. Springer.
Yosinski, J., Clune, J., Bengio, Y., and Lipson, H. (2014). How transferable are features in deep neural networks? Zeiler, M. D. and Fergus, R. (2013). Visualizing and understanding convolutional networks.
Published
2025-09-29
How to Cite
SOUZA E SILVA, Rafael; CARVALHO, André C. P. L. F. de.
UTMap: Triplet Neural Network for Uncertainty Medical Image Analysis. In: NATIONAL MEETING ON ARTIFICIAL AND COMPUTATIONAL INTELLIGENCE (ENIAC), 22. , 2025, Fortaleza/CE.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2025
.
p. 1104-1115.
ISSN 2763-9061.
DOI: https://doi.org/10.5753/eniac.2025.14372.
