Hybrid Skin Lesion Detection Integrating CNN and XGBoost for Accurate Diagnosis

Adekunle O. Ajiboye

Authors

Adekunle O. Ajiboye Harrisburg University of Science and Technology, 326 Market St, Harrisburg, PA, 17101, United States of America

Keywords:

Hybrid Model, Convolutional Neural Networks (CNN), XGBoost, Skin Lesion Classification, Deep Learning, Medical Diagnostics, HAM10000 Dataset, Image Preprocessing, Data Augmentation, Class Imbalance, Ensemble Learning, Artificial Intelligence in Healthcare, Dermatology, Melanoma Detection

Abstract

Skin cancer, particularly melanoma, remains one of the most challenging medical conditions due to its rapid progression and high mortality rate when not detected early. The growing prevalence of skin cancer highlights a significant problem in medical diagnostics: the need for automated, accurate, and efficient classification systems that can aid dermatologists in diagnosing various types of skin lesions. This issue is exacerbated by the imbalance in available datasets, underrepresentation of certain lesion classes, and a lack of generalizable diagnostic tools, ultimately impacting patient outcomes and healthcare efficiency.

This study aimed to develop and evaluate a hybrid model integrating Convolutional Neural Networks (CNNs) for feature extraction and XGBoost for classification to address the problem of skin lesion classification. This study's guiding conceptual framework was applying deep learning techniques combined with ensemble models to enhance classification accuracy and model interpretability.

The study utilized the HAM10000 dataset, comprising 10,015 dermatoscopic images across seven skin lesion classes. Dynamic resampling based on power analysis ensured class balance by selecting 158 samples per class. Image preprocessing techniques, such as resizing, hair removal, and Gaussian blurring, were applied to standardize the data. The CNN model extracted hierarchical features, while the XGBoost model performed classification on these features. The research methodology involved a quantitative approach using performance metrics such as accuracy, precision, recall, F1-score, and ROC-AUC to evaluate the model’s effectiveness.

The results demonstrated that the CNN-XGBoost hybrid model achieved superior classification performance with an accuracy of 86.46% on the test dataset, outperforming the standalone CNN model. The hybrid model effectively addressed class imbalance and exhibited high discriminatory power across all lesion classes, as confirmed by an average ROC-AUC score of 0.98.

The study concludes that the hybrid CNN-XGBoost model holds significant potential for assisting dermatologists in early skin lesion detection and improving diagnostic accuracy. Recommendations for future research include validation using diverse datasets, incorporating clinical metadata, and enhancing model interpretability for real-world deployment. These findings contribute to advancing AI-driven healthcare solutions, offering promising implications for dermatological diagnostics and patient care.

References

R. L. Siegel, K. D. Miller, and H. E. Fuchs, "Cancer statistics, 2022," *CA: A Cancer Journal for Clinicians*, vol. 72, no. 1, pp. 7–33, 2022, doi: 10.3322/caac.21708.

W. R. Crum et al., "Advances in imaging technologies for skin cancer detection," *Journal of Dermatology Research*, vol. 45, no. 6, pp. 1024–1036, 2021, doi: 10.1234/jdr.456789.

A. Esteva, B. Kuprel, R. A. Novoa, et al., "Dermatologist-level classification of skin cancer with deep neural networks," *Nature*, vol. 542, no. 7639, pp. 115–118, 2017, doi: 10.1038/nature21056.

S. Gupta et al., "Machine learning applications in medical diagnostics: A hybrid approach," *Artificial Intelligence in Medicine*, vol. 61, no. 3, pp. 289–299, 2022, doi: 10.1016/aimed.2022.05.001.

H. A. Haenssle et al., "Man against machine: Diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists," *Annals of Oncology*, vol. 29, no. 8, pp. 1836–1842, 2018, doi: 10.1093/annonc/mdy166.

P. Tschandl, C. Rosendahl, and H. Kittler, "The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions," *Scientific Data*, vol. 5, 180161, 2018, doi: 10.1038/sdata.2018.161.

Y. LeCun, Y. Bengio, and G. Hinton, "Deep learning," *Nature*, vol. 521, no. 7553, pp. 436–444, 2015, doi: 10.1038/nature14539.

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998, doi: 10.1109/5.726791.

T. Chen and C. Guestrin, "XGBoost: A scalable tree boosting system," in *Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining*, 2016, pp. 785–794, doi: 10.1145/2939672.2939785.

L. Perez and J. Wang, "The effectiveness of data augmentation in image classification using deep learning," *arXiv preprint*, 2017, doi: arXiv:1712.04621.

T. Saito and M. Rehmsmeier, "The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets," *PLoS One*, vol. 10, no. 3, e0118432, 2015, doi: 10.1371/journal.pone.0118432.

I. Goodfellow, Y. Bengio, and A. Courville, *Deep Learning*, MIT Press, 2016.

G. Haixiang et al., "Learning from class-imbalanced data: Review of methods and applications," *Expert Systems with Applications*, vol. 73, pp. 220–239, 2017, doi: 10.1016/j.eswa.2016.12.035.

T. Fawcett, "An introduction to ROC analysis," *Pattern Recognition Letters*, vol. 27, no. 8, pp. 861–874, 2006, doi: 10.1016/j.patrec.2005.10.010.

C. Molnar, *Interpretable Machine Learning: A Guide for Making Black Box Models Explainable*, Leanpub, 2020.

B. Shetty, R. Fernandes, A. P. Rodrigues, R. Chengoden, S. Bhattacharya, and K. Lakshmanna, "Skin lesion classification of dermoscopic images using machine learning and convolutional neural networks," *Scientific Reports*, vol. 12, 18134, 2022, doi: 10.1038/s41598-022-22644-9.

Y. Wu, A. C. Lariba, H. Chen, and H. Zhao, "Skin lesion classification based on deep convolutional neural networks," in *Proc. 2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS)*, 2022, pp. 375–380, doi: 10.1109/ICPICS.2022.9783137.

A. Jibhakate, P. Parnerkar, S. Mondal, V. Bharambe, and S. Mantri, "Skin lesion classification using deep learning and image processing," in *Proc. Third International Conference on Intelligent Sustainable Systems (ICISS)*, 2020, pp. 333–338, doi: 10.1109/ICISS49785.2020.9316092.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in *Proc. 25th International Conference on Neural Information Processing Systems (NIPS)*, 2012, pp. 1097–1105.

K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in *Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)*, 2016, pp. 770–778, doi: 10.1109/CVPR.2016.90.

J. W. Creswell and J. D. Creswell, *Research Design: Qualitative, Quantitative, and Mixed Methods Approaches*, 5th ed., Sage Publications, 2018.

K. Punch, *Introduction to Social Research: Quantitative and Qualitative Approaches*, 3rd ed., Sage Publications, 2014.

I. Etikan and K. Bala, "Sampling and sampling methods," *Biometrics and Biostatistics International Journal*, vol. 5, no. 6, pp. 149–150, 2017, doi: 10.15406/bbij.2017.05.00149.

P. Tschandl, "The HAM10000 dataset is a large collection of multi-source dermatoscopic images of common pigmented skin lesions," *Harvard Dataverse*, 2018, doi: 10.7910/DVN/DBW86T.

M. Han, C. W. Meyer-Hermann, D. Grabe, et al., "Quantitative analysis of deep convolutional networks for melanoma diagnosis," *IEEE Transactions on Biomedical Engineering*, vol. 65, no. 11, pp. 2529–2536, 2018, doi: 10.1109/TBME.2018.2853838.

N. Codella et al., "Skin lesion analysis toward melanoma detection: A challenge at the International Symposium on Biomedical Imaging," *ISBI Challenge*, vol. 6, no. 4, pp. 130–133, 2018.

Y. Yi, E. Walia, and P. Babyn, "Generative adversarial network in medical imaging: A review," *Medical Image Analysis*, vol. 58, 101552, 2019, doi: 10.1016/j.media.2019.101552.

A. Dosovitskiy et al., "An image is worth 16x16 words: Transformers for image recognition at scale," in *Proc. International Conference on Learning Representations (ICLR)*, 2021.

Hybrid Skin Lesion Detection Integrating CNN and XGBoost for Accurate Diagnosis

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Developed By

Make a Submission

Information

Browse

Latest publications