The COVID-19 pandemic has caused an unprecedented global public health crisis. Given its inherent nature, social distancing measures are proposed as the primary strategies to curb the spread of this pandemic. Therefore, identifying situations where these protocols are violated, has implications for curtailing the spread of the disease and promoting a sustainable lifestyle. This paper proposes a novel computer vision-based system to analyze CCTV footage to provide a threat level assessment of COVID-19 spread. The system strives to holistically capture and interpret the information content of CCTV footage spanning multiple frames to recognize instances of various violations of social distancing protocols, across time and space, as well as identification of group behaviors. This functionality is achieved primarily by utilizing a temporal graph-based structure to represent the information of the CCTV footage and a strategy to holistically interpret the graph and quantify the threat level of the given scene. The individual components are tested and validated on a range of scenarios and the complete system is tested against human expert opinion. The results reflect the dependence of the threat level on people, their physical proximity, interactions, protective clothing, and group dynamics. The system performance has an accuracy of 76%, thus enabling a deployable threat monitoring system in cities, to permit normalcy and sustainability in the society.
翻译:COVID-19大流行已造成前所未有的全球公共卫生危机,鉴于其内在性质,提议采取社会偏移措施,作为遏制这一流行病蔓延的主要战略,因此,查明违反这些协议的情况,对遏制疾病蔓延和促进可持续生活方式产生影响;本文件提议建立一个新型的计算机视觉系统,分析闭路电视镜头,对COVID-19大传播进行威胁程度评估;该系统力求全面捕捉和解释覆盖多个框架的闭路电视片段的信息内容,以识别各种违反社会偏移协议的事件,以及查明群体行为;这一功能的实现主要通过使用时间图结构来代表闭路电视镜头的信息,以及全面解释图表和量化特定场景威胁程度的战略;个别组成部分在一系列假设中进行测试和验证,整个系统根据人类专家意见进行测试;其结果反映了威胁程度对人、其身体相近、互动、防护服装和群体动态的依赖性。系统运行精确度为76%,从而能够使城市的可部署威胁监测系统能够正常地进行社会。