Ensemble of learning transfer models with GAN for data augmentation: Application to skin cancer detection
DOI:
https://doi.org/10.29327/2520355.14.4-9Palavras-chave:
Convolutional Neural Network, Transfer Learning, Ensemble, Data augmentationResumo
Deep Learning models typically require large datasets to achieve high performance, which can be challenging when applied to medical data due to the difficulty in acquiring sufficient observations, particularly when the data is imbalanced between categories like pathological and normal cases. To address this, approaches like transfer learning, ensemble methods, and data augmentation through Generative Adversarial Networks (GANs) can be utilized to improve classification tasks on small datasets. In situations with imbalanced data, resampling techniques are also crucial. This research combines transfer learning with resampling methods to boost the prediction accuracy of minority class samples in small, imbalanced datasets. Additionally, techniques like data retracing and GAN-based augmentation are applied. The dataset used includes small, imbalanced images of skin cancer, aimed at classifying them as malignant or benign. The findings reveal that models using resampling techniques achieve better results, while those without resampling underperform. This underscores the benefit of resampling in enhancing prediction, particularly for the minority class. Furthermore, using GANs for data augmentation improves model performance over those that do not incorporate this technique.
Referências
Al-Rasheed, A., Ksibi, A., Ayadi, M., Alzahrani, A. I., Zakariah, M., and Ali Hakami, N. (2022). An ensemble of transfer learning models for the prediction of skin cancers with conditional generative adversarial networks. Diagnostics, 12(12):3145.
Ali-Gombe, A. and Elyan, E. (2019). Mfc-gan: Class-imbalanced dataset classification using multiple fake class generative adversarial network. Neurocomputing, 361:212–221.
Alrashedy, H. H. N., Almansour, A. F., Ibrahim, D. M., and Hammoudeh, M. A. A. (2022). Braingan: Brain mri image generation and classification framework using gan architectures and cnn models. Sensors, 22(11):4297.
Alsaif, H., Guesmi, R., Alshammari, B. M., Hamrouni, T., Guesmi, T., Alzamil, A., and Belguesmi, L. (2022). A novel data augmentation-based brain tumor detection using convolutional neural network. Applied Sciences, 12(8):3773.
Anjos, B. H. L. d. et al. (2020). Predcgan: uma abordagem para geração de nódulos pulmonares sintéticos usando pré-treinamento. Master’s thesis, Universidade Federal de Alagoas, Maceió, AL.
Baldine, R. B., Fonseca, K. V. d. S., and Ferreira, E. B. (2024). Deep learning classification of apple leaf diseases: comparison of neural networks. Sigmae: Revista Eletrônica da Estatística, 13(5).
Bharathi Raja, N. and Selvi Rajendran, P. (2023). An efficient banana plant leaf disease classification using optimal ensemble deep transfer network. Journal of Experimental & Theoretical Artificial Intelligence, pages 1–24.
Chakraborty, T. and Chakraborty, A. K. (2020). Superensemble classifier for improving predictions in imbalanced datasets. Communications in Statistics: Case Studies, Data Analysis and Applications, 6(2):123–141.
Chatterjee, S., Hazra, D., Byun, Y.-C., and Kim, Y.-W. (2022). Enhancement of image classification using transfer learning and gan-based synthetic data augmentation. Mathematics, 10(9):1541.
Chen, Y., Yang, X.-H., Wei, Z., Heidari, A. A., Zheng, N., Li, Z., Chen, H., Hu, H., Zhou, Q., and Guan, Q. (2022). Generative adversarial networks in medical image augmentation: a review. Computers in Biology and Medicine, page 105382.
Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1251–1258.
Codella, N. C., Nguyen, Q.-B., Pankanti, S., Gutman, D. A., Helba, B., Halpern, A. C., and Smith, J. R. (2017). Deep learning ensembles for melanoma recognition in dermoscopy images. IBM Journal of Research and Development, 61(4/5):5–1.
Dai, W., Li, D., Tang, D., Wang, H., and Peng, Y. (2022). Deep learning approach for defective spot welds classification using small and class-imbalanced datasets. Neurocomputing, 477:46–60.
Ding, H., Sun, Y., Wang, Z., Huang, N., Shen, Z., and Cui, X. (2023). Rgan-el: A gan and ensemble learning-based hybrid approach for imbalanced data classification. Information Processing & Management, 60(2):103235.
Ferreira, G. A. and Suzuki, A. K. (2024). Adaptations of extreme gradient boosting for imbalanced datasets with application in credit scoring. Sigmae: Revista Eletrônica da Estatística, 13(4).
Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning. MIT Press. http://www.deeplearningbook.org.
GOOGLE (2017). Google colaboratory. https://colab.research.google.com/. Acesso em: 6 jun. 2025.
Gutman, D., Codella, N. C., Celebi, E., Helba, B., Marchetti, M., Mishra, N., and Halpern, A. (2016). Skin lesion analysis toward melanoma detection: A challenge at the international symposium on biomedical imaging (isbi) 2016, hosted by the international skin imaging collaboration (isic). arXiv preprint arXiv:1605.01397.
Hashemi, S. R., Salehi, S. S. M., Erdogmus, D., Prabhu, S. P., Warfield, S. K., and Gholipour, A. (2018). Asymmetric loss functions and deep densely-connected networks for highly-imbalanced medical image segmentation: Application to multiple sclerosis lesion detection. IEEE Access, 7:1721–1735.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778.
Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.
Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4700–4708.
Huang, Y., Jin, Y., Li, Y., and Lin, Z. (2020). Towards imbalanced image classification: a generative adversarial network ensemble learning method. IEEE Access, 8:88399–88409.
Ju, J., Zheng, H., Xu, X., Guo, Z., Zheng, Z., and Lin, M. (2022). Classification of jujube defects in small data sets based on transfer learning. Neural Computing and Applications, pages 1–14.
Khalifa, N. E., Loey, M., and Mirjalili, S. (2022). A comprehensive survey of recent trends in deep learning for digital images augmentation. Artificial Intelligence Review, pages 1–27.
Lee, J. and Park, K. (2021). Gan-based imbalanced data intrusion detection system. Personal and Ubiquitous Computing, 25:121–128.
Lee, N., Yang, H., and Yoo, H. (2021). A surrogate loss function for optimization of f beta score in binary classification with imbalanced data. arXiv preprint arXiv:2104.01459.
Lopez, A. R., Giro-i Nieto, X., Burdick, J., and Marques, O. (2017). Skin lesion classification from dermoscopic images using deep learning techniques. In 2017 13th IASTED international conference on biomedical engineering (BioMed), pages 49–54. IEEE.
Majtner, T., Yildirim-Yayilgan, S., and Hardeberg, J. Y. (2016). Combining deep learning and hand-crafted features for skin lesion classification. pages 1–6.
Manjunath, S. M., Gurjar, M., O’Kane, N., McCarren, A., and Gualano, L. (2022). Detection of covid 19 from an imbalanced chest x-ray image data set.
Mirza, M. and Osindero, S. (2014). Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784.
Mukhin, A. V., Kilbas, I. A., Paringer, R. A., Ilyasova, N. Y., and Kupriyanov, A. V. (2022). A method for balancing a multi-labeled biomedical dataset. Integrated Computer-Aided Engineering, (Preprint):1–17.
Nanni, L., Brahnam, S., Paci, M., and Ghidoni, S. (2022). Comparison of different convolutional neural network activation functions and methods for building ensembles for small to midsize medical data sets. Sensors, 22(16):6129.
Roy, S., Tyagi, M., Bansal, V., and Jain, V. (2022). Svd-clahe boosting and balanced loss function for covid-19 detection from an imbalanced chest x-ray dataset. Computers in Biology and Medicine, 150:106092.
Samee, N. A., Atteia, G., Meshoul, S., Al-antari, M. A., and Kadah, Y. M. (2022). Deep learning cascaded feature selection framework for breast cancer classification: Hybrid cnn with univariate-based approach. Mathematics, 10(19):3631.
Saravanan, T., Karthiha, K., Kavinkumar, R., Gokul, S., and Mishra, J. P. (2022). A novel machine learning scheme for face mask detection using pretrained convolutional neural network. Materials Today: Proceedings, 58:150–156.
Sarkar, D., Bali, R., and Ghosh, T. (2018). Hands-On Transfer Learning with Python: Implement advanced deep learning and neural network models using TensorFlow and Keras. Packt Publishing Ltd.
Szegedy, C., Ioffe, S., Vanhoucke, V., and Alemi, A. A. (2017). Inception-v4, inception-resnet and the impact of residual connections on learning. In Thirty-first AAAI conference on artificial intelligence.
Talatian Azad, S., Ahmadi, G., and Rezaeipanah, A. (2022). An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis. Journal of Experimental & Theoretical Artificial Intelligence, 34(6):949–969.
Vallabhajosyula, S., Sistla, V., and Kolli, V. K. K. (2022). Transfer learning-based deep ensemble neural network for plant leaf disease detection. Journal of Plant Diseases and Protection, 129(3):545–558.
von Wangenheim, A. (2018). Deep learning::reconhecimento de imagens. http://www.lapix.ufsc.br/ensino/visao/visao-computacionaldeep-learning/deep-learningreconhecimento-de-imagens#Modelos_de_VGG. acessado: 20-06-2019.
Zheng, Y., Li, C., Zhou, X., Chen, H., Xu, H., Li, Y., Zhang, H., Li, X., Sun, H., Huang, X., et al. (2022). Application of transfer learning and ensemble learning in image-level classification for breast histopathology. Intelligent Medicine.
Zhou, Q., Ren, C., and Qi, S. (2020). An imbalanced r-stdp learning rule in spiking neural networks for medical image classification. IEEE Access, 8:224162–224177.
Downloads
Publicado
Como Citar
Edição
Seção
Licença
Proposta de Política para Periódicos de Acesso Livre
Autores que publicam nesta revista concordam com os seguintes termos:
- Autores mantém os direitos autorais e concedem à revista o direito de primeira publicação, com o trabalho simultaneamente licenciado sob a Licença Creative Commons Attribution que permite o compartilhamento do trabalho com reconhecimento da autoria e publicação inicial nesta revista.
- Autores têm autorização para assumir contratos adicionais separadamente, para distribuição não-exclusiva da versão do trabalho publicada nesta revista (ex.: publicar em repositório institucional ou como capítulo de livro), com reconhecimento de autoria e publicação inicial nesta revista.
- Autores têm permissão e são estimulados a publicar e distribuir seu trabalho online (ex.: em repositórios institucionais ou na sua página pessoal) a qualquer ponto antes ou durante o processo editorial, já que isso pode gerar alterações produtivas, bem como aumentar o impacto e a citação do trabalho publicado (Veja O Efeito do Acesso Livre).