Out-of-distribution (OOD) detection is critical to ensuring the reliability and safety of machine learning systems. For instance, in autonomous driving, we would like the driving system to issue an alert and hand over the control to humans when it detects unusual scenes or objects that it has never seen during training time and cannot make a safe decision. The term, OOD detection, first emerged in 2017 and since then has received increasing attention from the research community, leading to a plethora of methods developed, ranging from classification-based to density-based to distance-based ones. Meanwhile, several other problems, including anomaly detection (AD), novelty detection (ND), open set recognition (OSR), and outlier detection (OD), are closely related to OOD detection in terms of motivation and methodology. Despite common goals, these topics develop in isolation, and their subtle differences in definition and problem setting often confuse readers and practitioners. In this survey, we first present a unified framework called generalized OOD detection, which encompasses the five aforementioned problems, i.e., AD, ND, OSR, OOD detection, and OD. Under our framework, these five problems can be seen as special cases or sub-tasks, and are easier to distinguish. We then review each of these five areas by summarizing their recent technical developments, with a special focus on OOD detection methodologies. We conclude this survey with open challenges and potential research directions.
翻译:例如,在自主驾驶中,我们希望驾驶系统在发现培训期间从未见过的异常场景或物体时,向人类发出警报,并将控制权交给发现异常场景或物体的人,而这种场景或物体是培训期间从未见过的,无法作出安全的决定。2017年首次出现的OOD探测这一术语首次出现,此后研究界日益关注,导致开发了大量方法,从基于分类的检测到基于密度的检测到基于远程的检测。与此同时,其他一些问题,包括异常探测、新发现(ND)、公开确认(OSR)和外部探测(OD),在动机和方法方面都与OOD探测密切相关。尽管共同目标是孤立地发展这些专题,在定义和问题方面的微妙差异往往使读者和从业人员感到困惑。在这次调查中,我们首先提出了一个统一框架,称为通用的OD检测,其中包括上述五个问题,即AD、ND、OD检测、OD检测和OD。在我们的框架下,这五个问题都与最近的研究方向或分级研究领域相比,我们可以将这些研究领域中的每个特殊领域和分级分析领域,我们随后将这些研究领域作为研究领域和分级研究重点。