In late December 2019, the novel coronavirus (Sars-Cov-2) and the resulting disease COVID-19 were first identified in Wuhan China. The disease slipped through containment measures, with the first known case in the United States being identified on January 20th, 2020. In this paper, we utilize survey data from the Inter-university Consortium for Political and Social Research and apply several statistical and machine learning models and techniques such as Decision Trees, Multinomial Logistic Regression, Naive Bayes, k-Nearest Neighbors, Support Vector Machines, Neural Networks, Random Forests, Gradient Tree Boosting, XGBoost, CatBoost, LightGBM, Synthetic Minority Oversampling, and Chi-Squared Test to analyze the impacts the COVID-19 pandemic has had on the mental health of frontline workers in the United States. Through the interpretation of the many models applied to the mental health survey data, we have concluded that the most important factor in predicting the mental health decline of a frontline worker is the healthcare role the individual is in (Nurse, Emergency Room Staff, Surgeon, etc.), followed by the amount of sleep the individual has had in the last week, the amount of COVID-19 related news an individual has consumed on average in a day, the age of the worker, and the usage of alcohol and cannabis.
翻译:2019年12月下旬,中国武汉首次发现新科罗纳病毒(Sars-Cov-2)以及由此而来的COVID-19疾病。该疾病通过遏制措施而滑落,2020年1月20日查明了美国首例已知病例。在本文中,我们利用大学间政治和社会研究联合会的调查数据,运用了若干统计和机器学习模型和技术,如决定树、多民族物流倒退、Nive Bayes、K-Nearest Nieghbors、支持病媒机器、神经网络、随机森林、“重树”运动、XGBost、卡特博斯、灯GBM、合成少数群体过度采样和奇斯堪称试验,以分析COVI-19大流行病对美国前线工人心理健康的影响。我们通过对适用于心理健康调查数据的许多模型的解释,得出结论,预测一线工人心理健康下降的最重要因素是个人保健作用、随机森林、重树、XOBost、个人睡眠使用率在个人睡眠年龄上的比例。