The abundance of dark matter (DM) subhalos orbiting a host galaxy is a generic prediction of the cosmological framework, and is a promising way to constrain the nature of DM. In this paper, we investigate the use of machine learning-based tools to quantify the magnitude of phase-space perturbations caused by the passage of DM subhalos. A simple binary classifier and an anomaly detection model are proposed to estimate if stars or star particles close to DM subhalos are statistically detectable in simulations. The simulated datasets are three Milky Way-like galaxies and nine synthetic Gaia DR2 surveys derived from these. Firstly, we find that the anomaly detection algorithm, trained on a simulated galaxy with full 6D kinematic observables and applied on another galaxy, is nontrivially sensitive to the DM subhalo population. On the other hand, the classification-based approach is not sufficiently sensitive due to the extremely low statistics of signal stars for supervised training. Finally, the sensitivity of both algorithms in the Gaia-like surveys is negligible. The enormous size of the Gaia dataset motivates the further development of scalable and accurate data analysis methods that could be used to select potential regions of interest for DM searches to ultimately constrain the Milky Way's subhalo mass function, as well as simulations where to study the sensitivity of such methods under different signal hypotheses.
翻译:在主机星系轨道上大量暗物质(DM)亚卤素环绕的暗物质(DM)亚卤素是宇宙框架的通用预测,是限制DM性质的一种有希望的方法。在本文中,我们调查了机器学习工具的使用,以量化DM子卤洛通过而导致的相位空间扰动的规模。建议了一个简单的二进制分类器和一个异常检测模型,以估计在模拟中,离DM亚卤罗斯近处的恒星或星粒是否在统计上可以检测。模拟数据集是三个类似银河的星系和从这些星系中得出的九项合成Gaia DR2调查。首先,我们发现,在模拟银河系上受训的反常识测算算法,在模拟中,对DMDM子星群群人口构成的不甚敏感度。另一方面,基于分类的方法不够敏感,因为用于监督培训的信号星的统计极低。最后,Gaia类调查中两种算法的敏感度是微不足道的。根据Gaia数据设置的庞大规模,可以促使在模拟中进一步开发具有6个运动观测功能的模拟的模拟的模拟系统,从而对MDMDMIS进行精确的精确分析,从而选择的潜在数据功能,从而选择了对MDMDMDMDRY的精确度进行大规模搜索的可能性。