Astronomers require efficient automated detection and classification pipelines when conducting large-scale surveys of the (optical) sky for variable and transient sources. Such pipelines are fundamentally important, as they permit rapid follow-up and analysis of those detections most likely to be of scientific value. We therefore present a deep learning pipeline based on the convolutional neural network architecture called $\texttt{MeerCRAB}$. It is designed to filter out the so called 'bogus' detections from true astrophysical sources in the transient detection pipeline of the MeerLICHT telescope. Optical candidates are described using a variety of 2D images and numerical features extracted from those images. The relationship between the input images and the target classes is unclear, since the ground truth is poorly defined and often the subject of debate. This makes it difficult to determine which source of information should be used to train a classification algorithm. We therefore used two methods for labelling our data (i) thresholding and (ii) latent class model approaches. We deployed variants of $\texttt{MeerCRAB}$ that employed different network architectures trained using different combinations of input images and training set choices, based on classification labels provided by volunteers. The deepest network worked best with an accuracy of 99.5$\%$ and Matthews correlation coefficient (MCC) value of 0.989. The best model was integrated to the MeerLICHT transient vetting pipeline, enabling the accurate and efficient classification of detected transients that allows researchers to select the most promising candidates for their research goals.
翻译:在对可变和短暂来源的(光学)天空进行大规模调查时,天文学家需要高效的自动检测和分类管道。这种管道具有根本重要性,因为它们允许对最有可能具有科学价值的探测进行快速跟踪和分析。因此,我们展示了以名为$\textt{MeerCRAB} 的进化神经网络结构为基础的深层次学习管道。因此,我们用两种方法将数据(i) 阈值和(ii) 潜值级模型方法中所谓的“机器人”探测从真正的天体物理来源中过滤出来。我们使用从这些图像中提取的各种2D图像和数字特征来描述光学候选人。输入图像与目标类别之间的关系不清楚,因为地面真相定义不甚明确,而且往往是辩论的主题。因此难以确定哪些信息来源应用于分类算法。因此,我们用两种方法将数据(i) 阈值和(ii) 潜值模型模型和(ii) 级模型模型模型模型方法。我们运用了最有前途的变异种变量来描述这些候选人。使用不同的网络结构进行了最精确的研究目标,用不同组合图像的模型和最精确的模型进行了培训。