Developing an effective automatic classifier to separate genuine sources from artifacts is essential for transient follow-ups in wide-field optical surveys. The identification of transient detections from the subtraction artifacts after the image differencing process is a key step in such classifiers, known as real-bogus classification problem. We apply a self-supervised machine learning model, the deep-embedded self-organizing map (DESOM) to this "real-bogus" classification problem. DESOM combines an autoencoder and a self-organizing map to perform clustering in order to distinguish between real and bogus detections, based on their dimensionality-reduced representations. We use 32x32 normalized detection thumbnails as the input of DESOM. We demonstrate different model training approaches, and find that our best DESOM classifier shows a missed detection rate of 6.6% with a false positive rate of 1.5%. DESOM offers a more nuanced way to fine-tune the decision boundary identifying likely real detections when used in combination with other types of classifiers, for example built on neural networks or decision trees. We also discuss other potential usages of DESOM and its limitations.
翻译:开发一个有效的自动分类器,将真正的来源从文物中分离出来,这对于在广域光学调查中进行临时跟踪至关重要。在图像差异化过程之后,从减值文物中识别从减值文物中识别瞬时检测是这类分类器中的一个关键步骤,称为真实博格斯分类问题。我们用一个自监督的机器学习模型,即深层嵌入的自我组织地图(DESOM),来应对“真实博格”分类问题。DESOM将自动编码器和自组织地图结合起来,进行集成,以便根据真实和模糊的特征来区分真实和模糊的检测。我们使用32x32的常规检测缩略图作为DESOM的投入。我们展示了不同的示范培训方法,发现我们最好的DESOM分类仪显示6.6%的误差检测率为1.5%。DESOM提供了一种更细化的方法,用以微调整确定与其他类别分类器组合使用时可能真实的检测结果,例如,用于其神经网络或决定树上的限制。