Identification of chemically similar stars using elemental abundances is core to many pursuits within Galactic archaeology. However, measuring the chemical likeness of stars using abundances directly is limited by systematic imprints of imperfect synthetic spectra in abundance derivation. We present a novel data-driven model that is capable of identifying chemically similar stars from spectra alone. We call this Relevant Scaled Component Analysis (RSCA). RSCA finds a mapping from stellar spectra to a representation that optimizes recovery of known open clusters. By design, RSCA amplifies factors of chemical abundance variation and minimizes those of non-chemical parameters, such as instrument systematics. The resultant representation of stellar spectra can therefore be used for precise measurements of chemical similarity between stars. We validate RSCA using 185 cluster stars in 22 open clusters in the APOGEE survey. We quantify our performance in measuring chemical similarity using a reference set of 151,145 field stars. We find that our representation identifies known stellar siblings more effectively than stellar abundance measurements. Using RSCA, 1.8% of pairs of field stars are as similar as birth siblings, compared to 2.3% when using stellar abundance labels. We find that almost all of the information within spectra leveraged by RSCA fits into a two-dimensional basis, which we link to [Fe/H] and alpha-element abundances. We conclude that chemical tagging of stars to their birth clusters remains prohibitive. However, using the spectra has noticeable gain, and our approach is poised to benefit from larger datasets and improved algorithm designs.
翻译:利用元素丰度来识别化学上相似的恒星是银河考古学中许多追求的核心内容。然而,通过直接利用丰度来测量恒星的化学相似性,由于在丰度衍生中不完美的合成光谱的系统印迹而受到限制。我们展示了一种新的数据驱动模型,能够仅从光谱中识别化学上相似的恒星。我们称之为“相关比例成份分析 ” ( RSCA) 。我们从恒星光谱中找到一个映像图,以优化已知开放群群的恢复。通过设计,RSCA放大了化学丰度变异因素,并尽量减少非化学参数(如仪器系统)的化学相似性。因此,恒星光光谱的生成面表示可以被用来精确测量星体之间的化学相似性。我们用APOGEEE调查中的22个开放组中的185个集星群验证了我们测量化学上相似性的表现,我们用151,145个外地恒星群的显示我们所知道的星系比星系丰度测量得更有效率。我们发现,1.8%的野生恒星系数据比仪器系统系统系统系统系统系统化的精确度测量。1.8%,使用生恒星系作为生的星系的星系的恒度数据,但我们通过SLMRBILMMMMMM)的精确链接中发现,我们找到了两个恒系的恒系的比。