Ensemble Architectures and Efficient Fusion Techniques for Convolutional Neural Networks: An Analysis on Resource Optimization Strategies

Resumo


The human gastrointestinal tract is prone to various abnormalities, including lethal diseases such as cancer, necessitating better endoscopic performance and standardized screening. Endoscopic scoring systems lack generalizability, emphasizing the need for artificial intelligence-based solutions. Using the HyperKvasir dataset, we employed deep learning, specifically Convolutional Neural Networks, or shortly CNNs, to analyze endoscopic images and videos. Our study focused on improving the classification of gastrointestinal tract diseases by proposing various CNN ensembles and fusion techniques. Through the use of seven CNN models and effective merging techniques, we achieved enhanced performance. Validation involved literature review and experiments. DenseNet-161 influenced the merger process, and integrating ResNet152 and VGG further enhanced effectiveness. Resource analysis included GPU model, RAM usage, and execution time. Results demonstrated comparable performance to the previous model, with F1-score of 0.910 and Matthews correlation coefficient, MCC for short, of 0.902, using 10 GB GPU RAM (compared to 15.8 GB). With 24.7 GB GPU RAM, F1-score of 0.913 and MCC of 0.905 were achieved. These findings advance our understanding of ensemble architectures and fusion techniques.
Publicado
25/09/2023
COSTA, Cícero L.; LIMA, Danielli A.; BARCELOS, Celia A. Zorzo; TRAVENÇOLO, Bruno A. N.. Ensemble Architectures and Efficient Fusion Techniques for Convolutional Neural Networks: An Analysis on Resource Optimization Strategies. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 12. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 107-121. ISSN 2643-6264.