Identification of experimentally acquired mass spectra of unknown compounds presents a~particular challenge because reliable spectral databases do not cover the potential chemical space with sufficient density. Therefore machine learning based \emph{de-novo} methods, which derive molecular structure directly from its mass spectrum gained attention recently. We present a~novel method in this family, addressing a~specific usecase of GC-EI-MS spectra, which is particularly hard due to lack of additional information from the first stage of MS/MS experiments, on which the previously published methods rely. We analyze strengths and drawbacks or our approach and discuss future directions.
翻译:摘要:鉴定实验采集的未知化合物的质谱图面临特殊挑战,因为可靠的光谱数据库未能以足够密度覆盖潜在的化学空间。因此,近来引起了人们的关注的基于机器学习的\emph{ de-novo }方法,可以直接从其质谱获得分子结构。我们在这个家族中提出了一种新颖的方法,解决GC-EI-MS谱图的一个特定应用场景,由于缺乏来自MS/MS实验的第一阶段的附加信息,因此特别困难,先前发布的方法依赖于该信息。我们分析了我们的方法的优缺点,并讨论了未来的方向。