Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach

Eric H. Chou; Chih Hung Wang; Yu Lin Hsieh; Babak Namazi; Jon Wolfshohl; Toral Bhakta; Chu Lin Tsai; Wan Ching Lien; Ganesh Sankaranarayanan; Chien Chang Lee; Tsung Chien Lu

doi:10.5811/WESTJEM.2020.12.49370

Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach

Eric H. Chou, Chih Hung Wang, Yu Lin Hsieh, Babak Namazi, Jon Wolfshohl, Toral Bhakta, Chu Lin Tsai, Wan Ching Lien, Ganesh Sankaranarayanan, Chien Chang Lee, Tsung Chien Lu

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Introduction: Within a few months coronavirus disease 2019 (COVID-19) evolved into a pandemic causing millions of cases worldwide, but it remains challenging to diagnose the disease in a timely fashion in the emergency department (ED). In this study we aimed to construct machine-learning (ML) models to predict severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection based on the clinical features of patients visiting an ED during the early COVID-19 pandemic. Methods: We retrospectively collected the data of all patients who received reverse transcriptase polymerase chain reaction (RT-PCR) testing for SARS-CoV-2 at the ED of Baylor Scott & White All Saints Medical Center, Fort Worth, from February 23-May 12, 2020. The variables collected included patient demographics, ED triage data, clinical symptoms, and past medical history. The primary outcome was the confirmed diagnosis of COVID-19 (or SARS-CoV-2 infection) by a positive RT-PCR test result for SARS-CoV-2, and was used as the label for ML tasks. We used univariate analyses for feature selection, and variables with P<0.1 were selected for model construction. Samples were split into training and testing cohorts on a 60:40 ratio chronologically. We tried various ML algorithms to construct the best predictive model, and we evaluated performances with the area under the receiver operating characteristic curve (AUC) in the testing cohort. Results: A total of 580 ED patients were tested for SARS-CoV-2 during the study periods, and 98 (16.9%) were identified as having the SARS-CoV-2 infection based on the RT-PCR results. Univariate analyses selected 21 features for model construction. We assessed three ML methods for performance: Of the three methods, random forest outperformed the others with the best AUC result (0.86), followed by gradient boosting (0.83) and extra trees classifier (0.82). Conclusion: This study shows that it is feasible to use ML models as an initial screening tool for identifying patients with SARS-CoV-2 infection. Further validation will be necessary to determine how effectively this prediction model can be used prospectively in clinical practice. [West J Emerg Med. 2021;22(2)244-251.].

Original language	English (US)
Pages (from-to)	244-251
Number of pages	8
Journal	Western Journal of Emergency Medicine
Volume	22
Issue number	2
DOIs	https://doi.org/10.5811/WESTJEM.2020.12.49370
State	Published - Mar 2021
Externally published	Yes

ASJC Scopus subject areas

Emergency Medicine

Access to Document

10.5811/WESTJEM.2020.12.49370

Cite this

Chou, E. H., Wang, C. H., Hsieh, Y. L., Namazi, B., Wolfshohl, J., Bhakta, T., Tsai, C. L., Lien, W. C., Sankaranarayanan, G., Lee, C. C., & Lu, T. C. (2021). Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach. Western Journal of Emergency Medicine, 22(2), 244-251. https://doi.org/10.5811/WESTJEM.2020.12.49370

Chou, EH, Wang, CH, Hsieh, YL, Namazi, B, Wolfshohl, J, Bhakta, T, Tsai, CL, Lien, WC, Sankaranarayanan, G, Lee, CC & Lu, TC 2021, 'Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach', Western Journal of Emergency Medicine, vol. 22, no. 2, pp. 244-251. https://doi.org/10.5811/WESTJEM.2020.12.49370

@article{e9271377db0f45c09a61ba884479bd1f,

title = "Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach",

abstract = "Introduction: Within a few months coronavirus disease 2019 (COVID-19) evolved into a pandemic causing millions of cases worldwide, but it remains challenging to diagnose the disease in a timely fashion in the emergency department (ED). In this study we aimed to construct machine-learning (ML) models to predict severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection based on the clinical features of patients visiting an ED during the early COVID-19 pandemic. Methods: We retrospectively collected the data of all patients who received reverse transcriptase polymerase chain reaction (RT-PCR) testing for SARS-CoV-2 at the ED of Baylor Scott & White All Saints Medical Center, Fort Worth, from February 23-May 12, 2020. The variables collected included patient demographics, ED triage data, clinical symptoms, and past medical history. The primary outcome was the confirmed diagnosis of COVID-19 (or SARS-CoV-2 infection) by a positive RT-PCR test result for SARS-CoV-2, and was used as the label for ML tasks. We used univariate analyses for feature selection, and variables with P<0.1 were selected for model construction. Samples were split into training and testing cohorts on a 60:40 ratio chronologically. We tried various ML algorithms to construct the best predictive model, and we evaluated performances with the area under the receiver operating characteristic curve (AUC) in the testing cohort. Results: A total of 580 ED patients were tested for SARS-CoV-2 during the study periods, and 98 (16.9%) were identified as having the SARS-CoV-2 infection based on the RT-PCR results. Univariate analyses selected 21 features for model construction. We assessed three ML methods for performance: Of the three methods, random forest outperformed the others with the best AUC result (0.86), followed by gradient boosting (0.83) and extra trees classifier (0.82). Conclusion: This study shows that it is feasible to use ML models as an initial screening tool for identifying patients with SARS-CoV-2 infection. Further validation will be necessary to determine how effectively this prediction model can be used prospectively in clinical practice. [West J Emerg Med. 2021;22(2)244-251.].",

author = "Chou, {Eric H.} and Wang, {Chih Hung} and Hsieh, {Yu Lin} and Babak Namazi and Jon Wolfshohl and Toral Bhakta and Tsai, {Chu Lin} and Lien, {Wan Ching} and Ganesh Sankaranarayanan and Lee, {Chien Chang} and Lu, {Tsung Chien}",

year = "2021",

month = mar,

doi = "10.5811/WESTJEM.2020.12.49370",

language = "English (US)",

volume = "22",

pages = "244--251",

journal = "Western Journal of Emergency Medicine",

issn = "1936-900X",

publisher = "University of California",

number = "2",

}

TY - JOUR

T1 - Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection

T2 - Machine-learning Approach

AU - Chou, Eric H.

AU - Wang, Chih Hung

AU - Hsieh, Yu Lin

AU - Namazi, Babak

AU - Wolfshohl, Jon

AU - Bhakta, Toral

AU - Tsai, Chu Lin

AU - Lien, Wan Ching

AU - Sankaranarayanan, Ganesh

AU - Lee, Chien Chang

AU - Lu, Tsung Chien

PY - 2021/3

Y1 - 2021/3

N2 - Introduction: Within a few months coronavirus disease 2019 (COVID-19) evolved into a pandemic causing millions of cases worldwide, but it remains challenging to diagnose the disease in a timely fashion in the emergency department (ED). In this study we aimed to construct machine-learning (ML) models to predict severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection based on the clinical features of patients visiting an ED during the early COVID-19 pandemic. Methods: We retrospectively collected the data of all patients who received reverse transcriptase polymerase chain reaction (RT-PCR) testing for SARS-CoV-2 at the ED of Baylor Scott & White All Saints Medical Center, Fort Worth, from February 23-May 12, 2020. The variables collected included patient demographics, ED triage data, clinical symptoms, and past medical history. The primary outcome was the confirmed diagnosis of COVID-19 (or SARS-CoV-2 infection) by a positive RT-PCR test result for SARS-CoV-2, and was used as the label for ML tasks. We used univariate analyses for feature selection, and variables with P<0.1 were selected for model construction. Samples were split into training and testing cohorts on a 60:40 ratio chronologically. We tried various ML algorithms to construct the best predictive model, and we evaluated performances with the area under the receiver operating characteristic curve (AUC) in the testing cohort. Results: A total of 580 ED patients were tested for SARS-CoV-2 during the study periods, and 98 (16.9%) were identified as having the SARS-CoV-2 infection based on the RT-PCR results. Univariate analyses selected 21 features for model construction. We assessed three ML methods for performance: Of the three methods, random forest outperformed the others with the best AUC result (0.86), followed by gradient boosting (0.83) and extra trees classifier (0.82). Conclusion: This study shows that it is feasible to use ML models as an initial screening tool for identifying patients with SARS-CoV-2 infection. Further validation will be necessary to determine how effectively this prediction model can be used prospectively in clinical practice. [West J Emerg Med. 2021;22(2)244-251.].

AB - Introduction: Within a few months coronavirus disease 2019 (COVID-19) evolved into a pandemic causing millions of cases worldwide, but it remains challenging to diagnose the disease in a timely fashion in the emergency department (ED). In this study we aimed to construct machine-learning (ML) models to predict severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection based on the clinical features of patients visiting an ED during the early COVID-19 pandemic. Methods: We retrospectively collected the data of all patients who received reverse transcriptase polymerase chain reaction (RT-PCR) testing for SARS-CoV-2 at the ED of Baylor Scott & White All Saints Medical Center, Fort Worth, from February 23-May 12, 2020. The variables collected included patient demographics, ED triage data, clinical symptoms, and past medical history. The primary outcome was the confirmed diagnosis of COVID-19 (or SARS-CoV-2 infection) by a positive RT-PCR test result for SARS-CoV-2, and was used as the label for ML tasks. We used univariate analyses for feature selection, and variables with P<0.1 were selected for model construction. Samples were split into training and testing cohorts on a 60:40 ratio chronologically. We tried various ML algorithms to construct the best predictive model, and we evaluated performances with the area under the receiver operating characteristic curve (AUC) in the testing cohort. Results: A total of 580 ED patients were tested for SARS-CoV-2 during the study periods, and 98 (16.9%) were identified as having the SARS-CoV-2 infection based on the RT-PCR results. Univariate analyses selected 21 features for model construction. We assessed three ML methods for performance: Of the three methods, random forest outperformed the others with the best AUC result (0.86), followed by gradient boosting (0.83) and extra trees classifier (0.82). Conclusion: This study shows that it is feasible to use ML models as an initial screening tool for identifying patients with SARS-CoV-2 infection. Further validation will be necessary to determine how effectively this prediction model can be used prospectively in clinical practice. [West J Emerg Med. 2021;22(2)244-251.].

UR - http://www.scopus.com/inward/record.url?scp=85103756231&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85103756231&partnerID=8YFLogxK

U2 - 10.5811/WESTJEM.2020.12.49370

DO - 10.5811/WESTJEM.2020.12.49370

M3 - Article

C2 - 33856307

AN - SCOPUS:85103756231

SN - 1936-900X

VL - 22

SP - 244

EP - 251

JO - Western Journal of Emergency Medicine

JF - Western Journal of Emergency Medicine

IS - 2

ER -

Clinical Features of Emergency Department Patients from Early COVID-19 Pandemic that Predict SARS-CoV-2 Infection: Machine-learning Approach

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this