Using Machine Learning for Identifying COVID-19
Keywords:
Machine Learning; Multilayer Perceptron; Random Forest; COVID-19; Supervised LearningAbstract
In late December 2019, an outbreak of the novel coronavirus, known as COVID-19, originated in Wuhan, China, ultimately evolving into a global pandemic. This study focuses on the application of two distinct machine learning approaches to predict COVID-19 presence in individuals. The dataset employed for analysis was obtained from clients who sought medical attention at Israelita Albert Einstein Hospital in São Paulo, Brazil. During their hospital visits, samples were collected for COVID-19 and additional laboratory tests. Specifically, we utilized supervised learning techniques, namely multilayer perceptron and random forest, to conduct our analysis. The performance of these models was evaluated using the Area under the ROC Curve (AUC), resulting in respective AUC values of 97% and 92.4% for multilayer perceptron and random forest methods.
References
Day, M. (2020). Covid-19: identifying and isolating asymptomatic people helped eliminate virus in Italian village. BMJ: British Medical Journal (Online), 368.
Ai, T., Yang, Z., Hou, H., Zhan, C., Chen, C., Lv, W., ... & Xia, L. (2020). Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology, 200642.
Apostolopoulos, I. D., & Mpesiana, T. A. (2020). Covid-19: automatic detection from x-ray images utilizing transfer learning with convolutional neural networks. Physical and Engineering Sciences in Medicine, 1.
Weinstock, M. B., Echenique, A. N. A., DABR, J. W. R., Leib, A., & ILLUZZI, F. A. (2020). Chest x-ray findings in 636 ambulatory patients with COVID-19 presenting to an urgent care center: a normal chest x-ray is no guarantee. J Urgent Care Med, 14(7), 13-8.
Cabitza, F., & Zeitoun, J. D. (2019). The proof of the pudding: in praise of a culture of real-world validation for medical artificial intelligence. Annals of translational medicine, 7(8).
Lai, C. C., Liu, Y. H., Wang, C. Y., Wang, Y. H., Hsueh, S. C., Yen, M. Y., ... & Hsueh, P. R. (2020). Asymptomatic carrier state, acute respiratory disease, and pneumonia due to severe acute respiratory syndrome coronavirus 2 (SARSCoV-2): facts and myths. Journal of Microbiology, Immunology and Infection.
Wang, D., Hu, B., Hu, C., Zhu, F., Liu, X., Zhang, J., ... & Zhao, Y. (2020). Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China. Jama, 323(11), 1061-1069.
Gao, Y., Li, T., Han, M., Li, X., Wu, D., Xu, Y., ... & Wang, L. (2020). Diagnostic utility of clinical laboratory data determinations for patients with the severe COVID‐19. Journal of medical virology.
Liu, Y., Yang, Y., Zhang, C., Huang, F., Wang, F., Yuan, J., ... & Zhang, Z. (2020). Clinical and biochemical indexes from 2019-nCoV infected patients linked to viral loads and lung injury. Science China Life Sciences, 63(3), 364-374.
Bayrak, E. A., Kırcı, P., & Ensari, T. (2019, April). Comparison of machine learning methods for breast cancer diagnosis. In 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT) (pp. 1-3). IEEE.
Wu, C. C., Yeh, W. C., Hsu, W. D., Islam, M. M., Nguyen, P. A. A., Poly, T. N., ... & Li, Y. C. J. (2019). Prediction of fatty liver disease using machine learning algorithms. Computer methods and programs in biomedicine, 170, 23-29.
Paul, D., Su, R., Romain, M., Sébastien, V., Pierre, V., & Isabelle, G. (2017). Feature selection for outcome prediction in oesophageal cancer using genetic algorithm and random forest classifier. Computerized Medical Imaging and Graphics, 60, 42-49.
Lei, Y., Wang, T., Harms, J., Shafai-Erfani, G., Tian, S., Higgins, K., ... & Liu, T. (2019, March). MRI-based pseudo CT generation using classification and regression random forest. In Medical Imaging 2019: Physics of Medical Imaging (Vol. 10948, p. 1094843). International Society for Optics and Photonics.
Varoquaux, Gaël, and Veronika Cheplygina. "Machine learning for medical imaging: methodological failures and recommendations for the future." NPJ digital medicine 5.1 (2022): 48.
Ahmad, Ghulab Nabi, et al. "Efficient medical diagnosis of human heart diseases using machine learning techniques with and without GridSearchCV." IEEE Access 10 (2022): 80151-80173.
Bacha, Sawssen, and Okba Taouali. "A novel machine learning approach for breast cancer diagnosis." Measurement 187 (2022): 110233.
Wu, J., Zhang, P., Zhang, L., Meng, W., Li, J., Tong, C., ... & Zhao, M. (2020). Rapid and accurate identification of COVID-19 infection through machine learning based on clinical available blood test results. medRxiv.
Brinati, D., Campagner, A., Ferrari, D., Locatelli, M., Banfi, G., & Cabitza, F. (2020). Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: a Feasibility Study. medRxiv.
Pfaff, Emily R., et al. "Identifying who has long COVID in the USA: a machine learning approach using N3C data." The Lancet Digital Health 4.7 (2022): e532-e541.
Chadaga, Krishnaraj, et al. "Clinical and laboratory approach to diagnose COVID-19 using machine learning." Interdisciplinary sciences: Computational life sciences 14.2 (2022): 452-470.
Alyasseri, Zaid Abdi Alkareem, et al. "Review on COVID‐19 diagnosis models based on machine learning and deep learning approaches." Expert systems 39.3 (2022): e12759.
Souza T, Gustavo WS and Souza HSP, (2020). COVID-19 Machine Learning-Based Rapid Diagnosis From Common Laboratory Tests, Online Published at: https://www.kaggle.com/felixthecat/covid-19-ml-policy-based-rapiddiagnosis.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 ILMA University
This work is licensed under a Creative Commons Attribution 4.0 International License.