Segmentation is better when shared: a review of public H&E histological images datasets

  • Anna Clara Medina Roissmann UNIFESP
  • Thaína A. Azevedo Tosta UNIFESP

Resumo


The evaluation of histological images is a key step in cancer diagnosis, but it is a time-consuming and subjective process. To overcome these challenges, computer-aided diagnosis systems have emerged to offer a faster and more accurate analysis. Among the steps of these systems, image segmentation plays a crucial role by isolating regions of interest for further examination. In this context, this systematic review investigates the use of publicly available datasets in histological image segmentation analysis using Hematoxylin-Eosin (H&E) staining. The review addresses 15 guiding questions, covering various aspects, including the most common segmentation techniques, evaluation metrics, and existing limitations in the literature.

Referências

X. Pan, L. Li, H. Yang, Z. Liu, J. Yang, L. Zhao, and Y. Fan, “Accurate segmentation of nuclei in pathological images via sparse reconstruction and deep convolutional networks,” Neurocomputing, vol. 229, pp. 88–99, 2017.

S. Vishnoi, A. K. Jain, and P. K. Sharma, “A nuclei segmentation method based on whale optimization algorithm fuzzy clustering in histopathological images,” in 2019 4th International Conference on Information Systems and Computer Networks (ISCON). IEEE, 2019, pp. 728–732.

M. R. Prusty, R. Dinesh, H. S. Kumar Sheth, A. L. Viswanath, and S. K. Satapathy, “Nuclei segmentation in histopathology images using structure-preserving color normalization based ensemble deep learning frameworks.” Computers, Materials & Continua, vol. 77, no. 3, 2023.

A. Mahbod, G. Schaefer, B. Bancher, C. Löw, G. Dorffner, R. Ecker, and I. Ellinger, “Cryonuseg: A dataset for nuclei instance segmentation of cryosectioned h&e-stained histological images,” Computers in biology and medicine, vol. 132, p. 104349, 2021.

M. Dabass, R. Vig, and S. Vashisth, “Review of histopathological image segmentation via current deep learning approaches,” in 2018 4th International Conference on Computing Communication and Automation (ICCCA). IEEE, 2018, pp. 1–6.

R. Krithiga and P. Geetha, “Breast cancer detection, segmentation and classification on histopathology images analysis: a systematic review,” Archives of Computational Methods in Engineering, vol. 28, no. 4, pp. 2607–2619, 2021.

W. R. Drioua, N. Benamrane, and L. Sais, “Breast cancer detection from histopathology images based on yolov5,” in 2022 7th International Conference on Frontiers of Signal Processing (ICFSP). IEEE, 2022, pp. 30–34.

I. Ahmad, Y. Xia, H. Cui, and Z. U. Islam, “Dan-nucnet: A dual attention based framework for nuclei segmentation in cancer histology images under wild clinical conditions,” Expert Systems with Applications, vol. 213, p. 118945, 2023.

X. Zhang, X. Zhu, K. Tang, Y. Zhao, Z. Lu, and Q. Feng, “Ddtnet: A dense dual-task network for tumor-infiltrating lymphocyte detection and segmentation in histopathological images of breast cancer,” Medical image analysis, vol. 78, p. 102415, 2022.

J.-M. Bokhorst, I. D. Nagtegaal, F. Fraggetta, S. Vatrano, W. Mesker, M. Vieth, J. van der Laak, and F. Ciompi, “Deep learning for multiclass semantic segmentation enables colorectal cancer detection and classification in digital pathology images,” Scientific Reports, vol. 13, no. 1, p. 8398, 2023.

M. Gour, S. Jain, and R. Agrawal, “Deeprnnetseg: deep residual neural network for nuclei segmentation on breast cancer histopathological images,” in International Conference on Computer Vision and Image Processing. Springer, 2019, pp. 243–253.

I. Kiran, B. Raza, A. Ijaz, and M. A. Khan, “Denseres-unet: Segmentation of overlapped/clustered nuclei from multi organ histopathology images,” Computers in Biology and Medicine, vol. 143, p. 105267, 2022.

A. Arab, V. Garcia, S. Guan, B. D. Gallas, B. Sahiner, N. Petrick, and W. Chen, “Effect of color-normalization on deep learning segmentation models for tumor-infiltrating lymphocytes scoring using breast cancer histopathology images,” in Medical Imaging 2023: Digital and Computational Pathology, vol. 12471. SPIE, 2023, pp. 390–394.

M. Dawood, K. Branson, N. M. Rajpoot, and F. Minhas, “Albrt: Cellular composition prediction in routine histology images,” in Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 664–673.

B. Wu, T. Kausar, Q. Xiao, M. Wang, W. Wang, B. Fan, and D. Sun, “Ff-cnn: An efficient deep neural network for mitosis detection in breast cancer histological images,” in Medical Image Understanding and Analysis: 21st Annual Conference, MIUA 2017, Edinburgh, UK, July 11–13, 2017, Proceedings 21. Springer, 2017, pp. 249–260.

Z. Yildirim, R. Samet, E. Hancer, N. Nemati, and M. T. Mali, “Gland segmentation in h&e histopathological images using u-net with attention module,” in 2023 Twelfth International Conference on Image Processing Theory, Tools and Applications (IPTA). IEEE, 2023, pp. 1–6.

L. S. Nair, R. Prabhu, G. Sugathan, K. V. Gireesh, and A. S. Nair, “Mitotic nuclei detection in breast histopathology images using yolov4,” in 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT). IEEE, 2021, pp. 1–5.

J. Li and X. Li, “Miu-net: Mix-attention and inception u-net for histopathology image nuclei segmentation,” Applied Sciences, vol. 13, no. 8, p. 4842, 2023.

T. Kausar, M. Wang, B. Wu, M. Idrees, and B. Kanwal, “Multi-scale deep neural network for mitosis detection in histological images,” in 2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), vol. 3. IEEE, 2018, pp. 47–51.

Y. Zhou, H. Chang, K. E. Barner, and B. Parvin, “Nuclei segmentation via sparsity constrained convolutional regression,” in 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI). IEEE, 2015, pp. 1284–1287.

S. Lal, D. Das, K. Alabhya, A. Kanfade, A. Kumar, and J. Kini, “Nucleisegnet: Robust deep learning architecture for the nuclei segmentation of liver cancer histopathology images,” Computers in Biology and Medicine, vol. 128, p. 104075, 2021.

J. K. L. Domoguen, J. J. P. Suarez, and P. C. Naval, “Tag: Nucleus detection in colorectal adenocarcinomas histology images using local texture, appearance, and gradient features,” in 2019 3rd International Conference on Imaging, Signal Processing and Communication (ICISPC). IEEE, 2019, pp. 155–159.

L.-W. Liu, Z. Huang, K.-Y. Lu, Z.-X. Wang, Y.-M. Liang, S.-Y. Lin, and Y.-H. Ji, “Ujat-net: A u-net combined joint-attention and transformer for breast tubule segmentation in h&e stained images,” IEEE Access, 2024.

H. M. AlGhamdiă, N. A. Koohbanani, N. Rajpoot, and S. E. A. Raza, “A novel cell map representation for weakly supervised prediction of er & pr status from h&e wsis,” in MICCAI Workshop on Computational Pathology. PMLR, 2021, pp. 10–19.

D. F. Dos Santos, P. R. de Faria, B. A. Travençolo, and M. Z. do Nascimento, “Influence of data augmentation strategies on the segmentation of oral histological images using fully convolutional neural networks,” Journal of Digital Imaging, vol. 36, no. 4, pp. 1608–1623, 2023.

K. Zhang, J. Fu, L. Hua, P. Zhang, Y. Shao, S. Xu, H. Zhou, L. Chen, and J. Wang, “Multiple morphological constraints-based complex gland segmentation in colorectal cancer pathology image analysis,” Complexity, vol. 2020, no. 1, p. 6180457, 2020.

N. Altini, A. Brunetti, E. Puro, M. G. Taccogna, C. Saponaro, F. A. Zito, S. De Summa, and V. Bevilacqua, “Ndg-cam: Nuclei detection in histopathology images with semantic segmentation networks and grad-cam,” Bioengineering, vol. 9, no. 9, p. 475, 2022.

J. Fu, K. Zhang, and P. Zhang, “Poorly differentiated colorectal gland segmentation approach based on internal and external stress in histology images,” in 2020 5th International Conference on Computer and Communication Systems (ICCCS). IEEE, 2020, pp. 338–342.

M. Abdolhoseini, M. G. Kluge, F. R. Walker, and S. J. Johnson, “Segmentation of heavily clustered nuclei from histopathological images,” Scientific reports, vol. 9, no. 1, p. 4551, 2019.

T. Kausar, M. Wang, M. A. Ashraf, and A. Kausar, “Smallmitosis: Small size mitotic cells detection in breast histopathology images,” IEEE Access, vol. 9, pp. 905–922, 2020.

H. Amitha, I. Selvamani, and D. A. S. Dhas, “Developement of computer aided system for detection and classification of mitosis using svm,” in 2017 International Conference on Inventive Computing and Informatics (ICICI). IEEE, 2017, pp. 954–958.

S. Graham, M. Jahanifar, A. Azam, M. Nimir, Y.-W. Tsang, K. Dodd, E. Hero, H. Sahota, A. Tank, K. Benes et al., “Lizard: a large-scale dataset for colonic nuclear instance segmentation and classification,” in Proceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 684–693.

C. Fernandez-Martín, J. Silva-Rodriguez, U. Kiraz, S. Morales, E. A. Janssen, and V. Naranjo, “Uninformed teacher-student for hard-samples distillation in weakly supervised mitosis localization,” Computerized Medical Imaging and Graphics, vol. 112, p. 102328, 2024.

O. C. Linares, A. A. Soriano-Vargas, B. S. Faiçal, B. Hamann, A. T. Fabro, and A. J. Traina, “Efficient segmentation of cell nuclei in histopathological images,” in 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS). IEEE, 2020, pp. 47–52.

A. K. Chanchal, S. Lal, and J. Kini, “Deep structured residual encoder-decoder network with a novel loss function for nuclei segmentation of kidney and breast histopathology images,” Multimedia Tools and Applications, vol. 81, no. 7, pp. 9201–9224, 2022.

R. KC Khatri, B. J Caseria, Y. Lou, G. Xiao, and Y. Cao, “Automatic extraction of cell nuclei using dilated convolutional network,” Inverse Problems & Imaging, vol. 15, no. 1, 2021.

L. Putzu and G. Fumera, “An empirical evaluation of nuclei segmentation from h&e images in a real application scenario,” Applied Sciences, vol. 10, no. 22, p. 7982, 2020.

P. He, A. Qu, S. Xiao, and M. Ding, “Detisseg: A dual-encoder network for tissue semantic segmentation of histopathology image,” Biomedical Signal Processing and Control, vol. 87, p. 105544, 2024.

Y. Chen, Y. Zhou, G. Chen, Y. Guo, Y. Lv, M. Ma, Z. Pei, and Z. Sun, “Segmentation of breast tubules in h&e images based on a dks-doubleu-net model,” BioMed Research International, vol. 2022, no. 1, p. 2961610, 2022.

Y. Xie, F. Xing, X. Shi, X. Kong, H. Su, and L. Yang, “Efficient and robust cell detection: A structured regression approach,” Medical image analysis, vol. 44, pp. 245–254, 2018.

Y.-R. Van Eycke, C. Balsat, L. Verset, O. Debeir, I. Salmon, and C. Decaestecker, “Segmentation of glandular epithelium in colorectal tumours to automatically compartmentalise ihc biomarker quantification: A deep learning approach,” Medical image analysis, vol. 49, pp. 35–45, 2018.

F. Mahmood, D. Borders, R. J. Chen, G. N. McKay, K. J. Salimian, A. Baras, and N. J. Durr, “Deep adversarial training for multi-organ nuclei segmentation in histopathology images,” IEEE transactions on medical imaging, vol. 39, no. 11, pp. 3257–3267, 2019.

K. Roy, D. Banik, G. K. Chan, O. Krejcar, and D. Bhattacharjee, “2pclpr: A two-phase clump profiler for segmentation of cancer cells in fluorescence microscopic images,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1–14, 2023.

E. Hancer, M. Traoré, R. Samet, Z. Yıldırım, and N. Nemati, “An imbalance-aware nuclei segmentation methodology for h&e stained histopathology images,” Biomedical Signal Processing and Control, vol. 83, p. 104720, 2023.

H. Chang, L. A. Loss, P. T. Spellman, A. Borowsky, and B. Parvin, “Batch-invariant nuclear segmentation in whole mount histology sections,” in 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI). IEEE, 2012, pp. 856–859.

Y. Lin, Z. Qu, H. Chen, Z. Gao, Y. Li, L. Xia, K. Ma, Y. Zheng, and K.-T. Cheng, “Nuclei segmentation with point annotations from pathology images via self-supervised learning and co-training,” Medical Image Analysis, vol. 89, p. 102933, 2023.

S. Kasturi, W. T. Tran, and A. Shenfield, “Accurate nuclei segmentation in breast cancer tumour biopsies,” in 2022 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB). IEEE, 2022, pp. 1–8.

H. Mittal and M. Saraswat, “An automatic nuclei segmentation method using intelligent gravitational search algorithm based superpixel clustering,” Swarm and Evolutionary Computation, vol. 45, pp. 15–32, 2019.

A. Mahbod, G. Schaefer, G. Dorffner, S. Hatamikia, R. Ecker, and I. Ellinger, “A dual decoder u-net-based model for nuclei instance segmentation in hematoxylin and eosin-stained histological images,” Frontiers in Medicine, vol. 9, p. 978146, 2022.

A. Kumar and M. Prateek, “Localization of nuclei in breast cancer using whole slide imaging system supported by morphological features and shape formulas,” Cancer management and research, pp. 4573–4583, 2020.

X. Wang, Y. Fang, S. Yang, D. Zhu, M. Wang, J. Zhang, K.-y. Tong, and X. Han, “A hybrid network for automatic hepatocellular carcinoma segmentation in h&e-stained whole slide images,” Medical Image Analysis, vol. 68, p. 101914, 2021.

R. Sharma and K. Sharma, “An optimal nuclei segmentation method based on enhanced multi-objective gwo,” Complex & Intelligent Systems, pp. 1–14, 2022.

J. Shu, J. Lei, Q. Gao, and Q. Zhang, “Combing colour detection and neural networks for gland detection,” in Proceedings of the 2nd International Conference on Artificial Intelligence and Pattern Recognition, 2019, pp. 33–36.

P. Rastogi, K. Khanna, and V. Singh, “Gland segmentation in colorectal cancer histopathological images using u-net inspired convolutional network,” Neural Computing and Applications, vol. 34, no. 7, pp. 5383–5395, 2022.

S. Doyle, F. Dal Canton, J. Wesseling, C. I. Sánchez, and J. Teuwen, “Mammary duct detection using self-supervised encoders,” in Medical Imaging 2022: Computer-Aided Diagnosis, vol. 12033. SPIE, 2022, pp. 229–232.

H. Naji, L. Sancere, A. Simon, R. Büttner, M.-L. Eich, P. Lohneis, and K. Bożek, “Holy-net: Segmentation of histological images of diffuse large b-cell lymphoma,” Computers in Biology and Medicine, vol. 170, p. 107978, 2024.

H. Zhang, J. Liu, P. Wang, Z. Yu, W. Liu, and H. Chen, “Cross-boosted multi-target domain adaptation for multi-modality histopathology image translation and segmentation,” IEEE Journal of Biomedical and Health Informatics, vol. 26, no. 7, pp. 3197–3208, 2022.

D. R. Chambers, B. B. Brimhall, D. R. Poole Jr, and E. A. Medina, “Cancer cell segmentation for cellularity prediction via a weakly labeled/strongly labeled hybrid convolutional neural network,” in Medical Imaging 2022: Digital and Computational Pathology, vol. 12039. SPIE, 2022, pp. 47–54.

A. Husham, M. Hazim Alkawaz, T. Saba, A. Rehman, and J. Saleh Alghamdi, “Automated nuclei segmentation of malignant using level sets,” Microscopy research and technique, vol. 79, no. 10, pp. 993–997, 2016.

R. Nateghi, H. Danyali, and M. S. Helfroush, “A deep learning approach for mitosis detection: application in tumor proliferation prediction from whole slide images,” Artificial intelligence in medicine, vol. 114, p. 102048, 2021.

K. S. Beevi, M. S. Nair, and G. Bindu, “Automatic mitosis detection in breast histopathology images using convolutional neural network based deep transfer learning,” Biocybernetics and Biomedical Engineering, vol. 39, no. 1, pp. 214–223, 2019.

G. I. Sayed and A. E. Hassanien, “Moth-flame swarm optimization with neutrosophic sets for automatic mitosis detection in breast cancer histology images,” Applied Intelligence, vol. 47, pp. 397–408, 2017.

S. F. H. Naqvi, S. Ayubi, A. Nasim, and Z. Zafar, “Automated gland segmentation leading to cancer detection for colorectal biopsy images,” in Advances in Information and Communication: Proceedings of the 2019 Future of Information and Communication Conference (FICC), Volume 2. Springer, 2020, pp. 75–83.

A. Mahbod, G. Schaefer, I. Ellinger, R. Ecker, Ö. Smedby, and C. Wang, “A two-stage u-net algorithm for segmentation of nuclei in h&e-stained tissues,” in Digital Pathology: 15th European Congress, ECDP 2019, Warwick, UK, April 10–13, 2019, Proceedings 15. Springer, 2019, pp. 75–82.

H. Ahn, Y. Hong, and K.-M. Kim, “Class-controlling copy-paste augmentation for nuclear segmentation,” in 2022 IEEE International Symposium on Biomedical Imaging Challenges (ISBIC). IEEE, 2022, pp. 1–4.

A. A. Aatresh, R. P. Yatgiri, A. K. Chanchal, A. Kumar, A. Ravi, D. Das, R. Bs, S. Lal, and J. Kini, “Efficient deep learning architecture with dimension-wise pyramid pooling for nuclei segmentation of histopathology images,” Computerized Medical Imaging and Graphics, vol. 93, p. 101975, 2021.

Z. Liu, Y. Cai, and Q. Tang, “Nuclei detection in breast histopathology images with iterative correction,” Medical & Biological Engineering & Computing, vol. 62, no. 2, pp. 465–478, 2024.

H. Wang, J. Yang, R. Katayama, M. Matsusaki, T. Miyao, and J. Zhou, “Nuclseg: nuclei segmentation using semi-supervised stain deconvolution,” in Proceedings of the 5th ACM International Conference on Multimedia in Asia, 2023, pp. 1–6.

W. Lou, H. Li, G. Li, X. Han, and X. Wan, “Which pixel to annotate: a label-efficient nuclei segmentation framework,” IEEE Transactions on Medical Imaging, vol. 42, no. 4, pp. 947–958, 2022.

G. M. Dogar, M. M. Fraz, and S. Javed, “Feature attention network for simultaneous nuclei instance segmentation and classification in histology images,” in 2021 International conference on digital futures and transformative technologies (ICoDT2). IEEE, 2021, pp. 1–6.

K. Yao, K. Huang, J. Sun, and A. Hussain, “Pointnu-net: Keypoint-assisted convolutional neural network for simultaneous multi-tissue histology nuclei segmentation and classification,” IEEE Transactions on Emerging Topics in Computational Intelligence, vol. 8, no. 1, pp. 802–813, 2023.

S. Roy, R. Khurana, and V. Jain, “Log-base2 of gaussian kernel for nuclei segmentation from colorectal cancer h and e-stained histopathology images,” in Fifteenth International Conference on Machine Vision (ICMV 2022), vol. 12701. SPIE, 2023, pp. 81–88.

“Kmc liver dataset,” [link], 2021, [Online; accessed 03-June-2025].

“Cryonuseg dataset,” [link], 2021, [Online; accessed 30-May-2025].

“Mthh dataset,” [link], 2022, [Online; accessed 01-June-2025].

“Lynsec dataset,” [link], 2024, [Online; accessed 02-June-2025].

“Tcga-brca dataset,” [link], [Online; accessed 10-June-2025].

“Miccai glas challenge database,” [link], 2022, [Online; accessed 01-June-2025].

“Paip dataset,” [link], 2019, [Online; accessed 14-June-2025].

“Uhcmc&cwru dataset,” [link], 2018, [Online; accessed 01-June-2025].

“Breastpathq dataset,” [link], 2019, [Online; accessed 01-June-2025].

“Amida 13 dataset,” [link], [Online; accessed 15-June-2025].

“Icpr12 dataset,” [link], [Online; accessed 15-June-2025].

“Mitos & atypia 14 database,” [link], 2014, [Online; accessed 31-May-2025].

“Tupac database,” [link], 2019, [Online; accessed 30-May-2025].

“Mitos wsi ccmct dataset,” [link], [Online; accessed 09-June-2025].

“Midog21 dataset,” [link], 2021, [Online; accessed 09-June-2025].

“Bach iciar,” [link], 2019, [Online; accessed 15-June-2025].

“Conic challenge dataset,” [link], 2022, [Online; accessed 01-June-2025].

“Consep dataset,” [link], [Online; accessed 15-June-2025].

“Cpm-17 dataset,” [link], 2019, [Online; accessed 01-June-2025].

“Monusac dataset,” [link], 2020, [Online; accessed 30-May-2025].

“Monuseg dataset,” [link], 2017, [Online; accessed 30-May-2025].

“Nucls dataset,” [link], [Online; accessed 02-June-2025].

“Nulnsseg dataset,” [link], 2022, [Online; accessed 30-May-2025].

“Pannuke dataset,” [link], [Online; accessed 01-June-2025].

“Tcga-kirc,” [link], [Online; accessed 01-June-2025].

“Tnbc dataset,” [link], 2018, [Online; accessed 30-May-2025].

“Ucsb database,” [link], 2020, [Online; accessed 01-June-2025].

“Benchmark dataset,” [link], [Online; accessed 01-June-2025].

“Local crc dataset,” [link], [Online; accessed 15-June-2025].

“Ocdc dataset,” [link], 2022, [Online; accessed 03-June-2025].

“Orca dataset,” [link], 2020, [Online; accessed 03-June-2025].

“Bca-lym dataset,” [link], [Online; accessed 01-June-2025].

“Post-nat-brca dataset,” [link], [Online; accessed 01-June-2025].

“Bcss dataset,” [link], [Online; accessed 01-June-2025].

“Luad-histoseg dataset,” [link], [Online; accessed 01-June-2025].

D. Müller, I. Soto-Rey, and F. Kramer, “Towards a guideline for evaluation metrics in medical image segmentation,” BMC Research Notes, vol. 15, no. 1, p. 210, 2022.
Publicado
30/09/2025
ROISSMANN, Anna Clara Medina; TOSTA, Thaína A. Azevedo. Segmentation is better when shared: a review of public H&E histological images datasets. In: WORKSHOP ON DIGITAL AND COMPUTATIONAL PATHOLOGY - CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 38. , 2025, Salvador/BA. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 353-359.