The upcoming large scale surveys like LSST are expected to find approximately $10^5$ strong gravitational lenses by analysing data of many orders of magnitude larger than those in contemporary astronomical surveys. In this case, non-automated techniques will be highly challenging and time-consuming, even if they are possible at all. We propose a new automated architecture based on the principle of self-attention to find strong gravitational lenses. The advantages of self-attention-based encoder models over convolution neural networks are investigated, and ways to optimise the outcome of encoder models are analysed. We constructed and trained 21 self-attention based encoder models and five convolution neural networks to identify gravitational lenses from the Bologna Lens Challenge. Each model was trained separately using 18,000 simulated images, cross-validated using 2,000 images, and then applied to a test set with 100,000 images. We used four different metrics for evaluation: classification accuracy, area under the receiver operating characteristic curve (AUROC), the TPR$_0$ score and the TPR$_{10}$ score. The performances of self-attention-based encoder models and CNNs participating in the challenge are compared. They were able to surpass the CNN models that participated in the Bologna Lens Challenge by a high margin for the $TPR_0$ and $TPR_${10}$. Self-Attention based models have clear advantages compared to simpler CNNs. They have highly competing performance in comparison to the currently used residual neural networks. Compared to CNNs, self-attention based models can identify highly confident lensing candidates and will be able to filter out potential candidates from real data. Moreover, introducing the encoder layers can also tackle the over-fitting problem present in the CNNs by acting as effective filters.
翻译:即将到来的大型调查,如LSST, 预计将通过分析比当代天文调查中的数据规模大得多的许多数量级的数据,找到大约 10 5 5 美元的强大引力镜。 在这种情况下, 非自动技术将具有高度的挑战性和耗时性,即使这些技术是可能的。 我们提议了一个新的自动化结构, 其依据的原则是自觉寻找强大的引力镜; 调查了以自我注意为基础的电离层模型相对于 convolution 神经网络的优势,并分析了优化编码模型结果的方法。 我们建造并培训了21个以自我注意为基础的编码模型模型和5个共振动神经网络,以识别博洛尼亚激光挑战中的引力透镜。 我们提出一个新的自动化结构, 以自我注意为基础, 以强重重重重重力透析镜; 我们使用四种不同的衡量标准来进行评估: 将离心电离心电离子电离子网络的准确度, 以TR$ 进行更精确的网络的比值比值为0.0, 和参加高振动的网络中的TRCRCO 10 参与度测试。