Covid-19 Detecting in Computed Tomography Lungs Images using Machine and transfer Learning

Dalila Cherifi, Abderraouf Djaber, Mohammed-Elfateh Guedouar, Amine Feghoul, Zahia Zineb Chelbi, Amazigh Ait Ouakli

Abstract


Coronavirus disease 2019 (COVID-19) is a fast-spreading disease infectious that causes lung pneumonia which killed millions of lives around the world and has a significant impact on public healthcare. The diagnostic approach of the infection is principally divided into two broad categories, a laboratory-based and chest radiography approach where the CT imaging tests showed some advantages in the prediction over the other methods. Due to the restricted medical capability and the impressive raise of the suspected cases, the need for finding an immediate, accurate and automated method to alleviate the overcapacity of radiologists’ efforts for diagnosis has emerged. In order to accomplish this objective, our work is based on developing machine and deep learning algorithms to classify chest CT scans into Covid or non-Covid classes.To obtain a good performance, the accuracy of the classifier should be high so the patients may have a clear idea about their state. For this purpose, there are many hyper parameters that can be changed in order to advance the performance of the artificial models that are used for the identification of such illnesses. We have worked on two non-similar datasets from different sources, a small one of 746 images and a larger one with 14486 images. In the other hand, we have proposed various machine learning models starting by an SVM which contains different kernel types, KNN model with changing the distance measurements and an RF model with two different number of trees. Moreover, two CNN based approaches have been developed considering one convolution layer followed by a pooling layer then two consecutive convolution layers followed by a single pooling layer each time. The machine learning models showed better performance comparing to the CNN on the small dataset. While on the large dataset, CNN outperforms these algorithms. In order to improve performance of the models, transfer learning also have been used in this project where we trained the pre-trained InceptionV3 and ResNet50V2 on the same datasets. Among all the examined classifiers, the ResNet50V2 achieved the best scores with 86.67% accuracy, 93.94% sensitivity, 81% specificity and 86% F1-score on the small dataset while the respective scores on the large dataset were 97.52%, 97.28%, 97.77% and 98%. Experimental interpretation advise the potential applicability of ResNet50V2 transfer learning approach in real diagnostic scenarios, which might be of very high usefulness in terms of achieving fast testing for COVID19.


Full Text:

PDF

References


World Health Organization, "Origin of SARS-CoV-2," 26 March 2020. [Online]. Available: https://apps.who.int/iris/handle/10665/332197. [Accessed May 2021]. License: CC BY-NC-SA 3.0 IGO.

World Health Organization, "WHO Director-General's remarks at the media briefing on 2019-nCoV on 11 February 2020," 11 February 2020. [Online]. Available: https://www.who.int/director-general/speeches/detail/who-director-general-s-remarks-at-the-media-briefing-on-2019-ncov-on-11-february-2020. [Accessed May 2021].

World Health Organization, "WHO Director-General's opening remarks at the media briefing on COVID-19 - 11 March 2020," 11 March 2020. [Online]. Available: https://www.who.int/director-general/speeches/detail/who-director-general-s-opening-remarks-at-the-media-briefing-on-covid-19---11-march-2020. [Accessed May 2021].

Worldometer, "COVID-19 CORONAVIRUS PANDEMIC," [Online]. Available: https://www.worldometers.info/coronavirus/. [Accessed June 2021].

J. Peiris, "Coronaviruses," in Medical Microbiology, Eighteenth ed., 2012, p. 587–593.

C. Giaimo, "The Spiky Blob Seen Around the World," The New York Times, 01 April 2020. [Online]. Available: https://www.nytimes.com/2020/04/01/health/coronavirus-illustration-cdc.html/. [Accessed June 2021].

S. Williams, "A Brief History of Human Coronaviruses," The Scientist, 02 June 2020. [Online]. Available: https://www.the-scientist.com/news-opinion/a-brief-history-of-human-coronaviruses-67600/. [Accessed June 2021].

Centers for Disease Control and Prevention, "Symptoms of COVID-19," 22 February 2021. [Online]. Available: https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html. [Accessed May 2021].

World Health Organization, "Coronavirus disease (COVID-19) Situation Report-51," 11 March 2020. [Online]. Available: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports. [Accessed May 2021].

Wikipedia, "Transmission of COVID-19," 2020. [Online]. Available: https://en.wikipedia.org/wiki/Transmission_of_COVID-19. [Accessed May 2021].

February 2020. [Online]. Available: https://www.who.int/director-general/speeches/detail/who-director-general-s-remarks-at-the-media-briefing-on-2019-ncov-on-11-february-2020. [Accessed May 2021].

E. Chamorro, A. Tascón, L. Sanz, S. Vélez and S. Nacenta, "Radiologic diagnosis of patients with COVID-19," in Radiología (English Edition), vol. 63, 2021, pp. 56-73.

T. C. Kwee and R. M. Kwee, "Chest CT in COVID-19: What the Radiologist Needs to Know," in RadioGraphics, vol. 40, 2020, pp. 1848-1865.

C. Hayashi, "What is Data Science? Fundamental Concepts and a Heuristic Example," in Data Science, Classification, and Related Methods, Springer Japan, 1998, p. 40–51.

D. Srivastava, "Data classification using support vector machine," 2010.

"Support Vector Machine (SVM) | Machine Learning," [Online]. Available: https://www.aionlinecourse.com/tutorial/machine-learning/support-vector-machine. [Accessed may 2021].

[Online]. Available: https://towardsdatascience.com/support-vector-machine-simply-explained-fee28eba5496. [Accessed may 2021].

T. K. Ho, "A Data Complexity Analysis of Comparative Advantages of Decision Forest Constructors," in Pattern Analysis & Applications, vol. 5, 2002, p. 102–112.

I. Atawodi, "A Machine Learning Approach to Network Intrusion Detection System Using K Nearest Neighbor and Random Forest," University of Southern Mississippi, 2019.

K. Zakka, "A Complete Guide to K-Nearest-Neighbors with Applications in Python and R," 13 July 2016. [Online]. Available: https://kevinzakka.github.io/2016/07/13/k-nearest-neighbor/. [Accessed May 2021].

I. José, "KNN (K-Nearest Neighbors)," 8 November 2018. [Online]. Available: https://towardsdatascience.com/knn-k-nearest-neighbors-1-a4707b24bd1d. [Accessed May 2021].

"K-Nearest Neighbor in 4 Steps | | Machine Learning," [Online]. Available: https://www.aionlinecourse.com/tutorial/machine-learning/k-nearest-neighbor. [Accessed May 2021].

R. Rojas, Neural Networks - A Systematic Introduction. SpringerVerlag, Berlin: SpringerVerlag, 1996.

"A Beginner Intro to Neural Networks," [Online]. Available: https://purnasaigudikandula.medium.com/a-beginner-intro-to-neural-networks-543267bda3c8. [Accessed June 2021].

"image convolution from scratch," [Online]. Available: https://github.com/ashushekar/image-convolution-from-scratch/blob/master/README.md. [Accessed June 2021].

"Max-pooling / Pooling," [Online]. Available: https://computersciencewiki.org/index.php/Max-pooling_/_Pooling. [Accessed June 2021].

"What is Convolutional Neural Network? What are all the layers used in it?," [Online]. Available: https://www.i2tutorials.com/what-is-convolutional-neural-network-what-are-all-the-layers-used-in-it/. [Accessed June 2021].

"Backpropagation," [Online]. Available: https://www.programmersought.com/article/86173368842/. [Accessed June 2021].

J. Brownlee, "A Gentle Introduction to Transfer Learning for Deep Learning," 20 December 2017. [Online]. Available: https://machinelearningmastery.com/transfer-learning-for-deep-learning/. [Accessed June 2021].

K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," Microsoft Research, 10 December 2015.

A. Milton-Barker, "Inception V3 Deep Convolutional Architecture For Classifying Acute Myeloid/Lymphoblastic Leukemia," Intel AI Developer Program, 17 February 2019. [Online]. Available: https://software.intel.com/content/www/us/en/develop/articles/inception-v3-deep-convolutional-architecture-for-classifying-acute-myeloidlymphoblastic.html. [Accessed June 2021].

V. Jayaswal, "Performance Metrics: Confusion matrix, Precision, Recall, and F1 Score," 14 September 2020. [Online]. Available: https://towardsdatascience.com/performance-metrics-confusion-matrix-precision-recall-and-f1-score-a8fe076a2262. [Accessed June 2021].

L. Blanche, "COVID-19 Lung CT Scans," 09 April 2020. [Online]. Available: https://www.kaggle.com/luisblanche/covidct. [Accessed May 2021].

M. Maftouni, "Large COVID-19 CT scan slice dataset," 21 March 2021. [Online]. Available: https://www.kaggle.com/maedemaftouni/large-covid19-ct-slice-dataset. [Accessed May 2021].

[Online]. Available: https://scikit-learn.org/stable/getting_started.html. [Accessed June 2021].

[Online]. Available: https://www.tensorflow.org/. [Accessed June 2021].

[Online]. Available: https://keras.io/. [Accessed June 2021].

[Online]. Available: https://stats.stackexchange.com/questions/153789/is-validation-set-always-necessary. [Accessed June 2021].

J. Brownlee, "What is the Difference Between Test and Validation Datasets?," 14 July 2017. [Online]. Available: https://machinelearningmastery.com/difference-test-validation-datasets/. [Accessed June 2021].

J. Brownlee, "A Gentle Introduction to Dropout for Regularizing Deep Neural Networks," 3 December 2018. [Online]. Available: https://machinelearningmastery.com/dropout-for-regularizing-deep-neural-networks/?fbclid=IwAR0Xl3eUW-_bhXBppz8GGG9wlyYa4Us4ZW59EOYz74M7LgqWZZrkXvkE9lo#:~:text=regularizers%20are%20present.-. [Accessed June 2021].

X. He, X. Yang, S. Zhang, J. Zhao, Y. Zhang, E. Xing and P. Xie, "Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans," 17 April 2020. DOI: 10.1101/2020.04.13.20063941.




DOI: https://doi.org/10.31449/inf.v47i8.4258

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.