We introduce an algorithmic method for population anomaly detection based on gaussianization through an adversarial autoencoder. This method is applicable to detection of `soft' anomalies in arbitrarily distributed highly-dimensional data. A soft, or population, anomaly is characterized by a shift in the distribution of the data set, where certain elements appear with higher probability than anticipated. Such anomalies must be detected by considering a sufficiently large sample set rather than a single sample. Applications include, but not limited to, payment fraud trends, data exfiltration, disease clusters and epidemics, and social unrests. We evaluate the method on several domains and obtain both quantitative results and qualitative insights.
翻译:我们采用一种算法方法,通过对抗性自动编码器检测人口异常现象,该方法适用于在任意分布的高度维度数据中检测“软”异常现象,软数据或人口异常现象的特点是数据集分布发生变化,其中某些要素的概率高于预期,这种异常现象必须通过考虑足够大的样本组而不是单一样本来检测,应用包括但不限于付款欺诈趋势、数据过滤、疾病群和流行病以及社会动乱。我们评估了几个领域的方法,并获得了定量结果和定性洞察力。