Distinguishing COVID-19 from other flu-like illnesses can be difficult due to ambiguous symptoms and still an initial experience of doctors. Whereas, it is crucial to filter out those sick patients who do not need to be tested for SARS-CoV-2 infection, especially in the event of the overwhelming increase in disease. As a part of the presented research, logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19, were generated. Each of the methods was tuned to achieve an assumed acceptable threshold of negative predictive values during classification. Additionally, an explanation of the obtained classification models was presented. The explanation enables the users to understand what was the basis of the decision made by the model. The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease. Moreover, the data set constituting the basis for the analyses performed is made available to the research community. This data set consisting of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
翻译:将COVID-19与其他类似流感的疾病区别开来,可能由于症状模糊不清,而且仍然是医生的最初经历而困难重重;虽然必须筛选那些不需要就SARS-COV-2感染进行检测的病人,特别是在疾病大量增加的情况下;作为提出的研究、后勤回归和XGBoost分类方法的一部分,产生了能够有效筛查COVID-19病人的CGBOost分类方法;每种方法都经过调整,以便在分类期间达到一个可接受的负预测值阈值;此外,还介绍了对所获得的分类模型的解释;这些解释使用户能够了解模型所作决定的依据;获得的分类模型为DECODE服务提供了基础(decode.polsl.pl),该服务可以作为检查COVID-19疾病病人的支持;此外,构成分析基础的数据集提供给了研究界;这套数据由3,000多个实例组成,以波兰医院收集的问卷为基础。