Early outbreak detection is a key aspect in the containment of infectious diseases, as it enables the identification and isolation of infected individuals before the disease can spread to a larger population. Instead of detecting unexpected increases of infections by monitoring confirmed cases, syndromic surveillance aims at the detection of cases with early symptoms, which allows a more timely disclosure of outbreaks. However, the definition of these disease patterns is often challenging, as early symptoms are usually shared among many diseases and a particular disease can have several clinical pictures in the early phase of an infection. To support epidemiologists in the process of defining reliable disease patterns, we present a novel, data-driven approach to discover such patterns in historic data. The key idea is to take into account the correlation between indicators in a health-related data source and the reported number of infections in the respective geographic region. In an experimental evaluation, we use data from several emergency departments to discover disease patterns for three infectious diseases. Our results suggest that the proposed approach is able to find patterns that correlate with the reported infections and often identifies indicators that are related to the respective diseases.
翻译:早期爆发检测是遏制传染病的一个关键方面,因为早期爆发检测能够使感染者在疾病传播到更多人口之前就能够识别和孤立感染者。与其通过监测确诊病例来发现意外增加的感染病例,还不如以早期症状检测为目的,从而更及时地披露疾病爆发情况。然而,这些疾病模式的定义往往具有挑战性,因为早期症状通常由许多疾病共同存在,特定疾病在感染的早期阶段可以有几张临床图片。为了支持流行病学家在确定可靠疾病模式的过程中,我们提出了一个新的、以数据为驱动的方法,在历史数据中发现这种模式。关键的想法是考虑与健康有关的数据来源的指标与各自地理区域报告的感染人数之间的相互关系。在一项实验性评估中,我们利用几个紧急部门的数据来发现三种传染病的疾病模式。我们的结果表明,拟议的方法能够找到与所报告的感染相关的模式,并常常确定与各自疾病相关的指标。