Interstellar dust corrupts nearly every stellar observation, and accounting for it is crucial to measuring physical properties of stars. We model the dust distribution as a spatially varying latent field with a Gaussian process (GP) and develop a likelihood model and inference method that scales to millions of astronomical observations. Modeling interstellar dust is complicated by two factors. The first is integrated observations. The data come from a vantage point on Earth and each observation is an integral of the unobserved function along our line of sight, resulting in a complex likelihood and a more difficult inference problem than in classical GP inference. The second complication is scale; stellar catalogs have millions of observations. To address these challenges we develop ziggy, a scalable approach to GP inference with integrated observations based on stochastic variational inference. We study ziggy on synthetic data and the Ananke dataset, a high-fidelity mechanistic model of the Milky Way with millions of stars. ziggy reliably infers the spatial dust map with well-calibrated posterior uncertainties.
翻译:星际尘埃几乎腐蚀了每颗恒星的观测,并且对它进行衡算对于测量恒星的物理特性至关重要。我们用高山进程(GP)将灰尘分布模拟成一个空间上差异的潜伏场,并开发一种比例为百万天文观测的概率模型和推断方法。建模星际尘埃因两个因素而复杂化。第一个因素是综合观测。数据来自地球的方位和每个观测都是我们视线上未观测的函数的有机组成部分,导致一个复杂的可能性和比古典GP推断中更困难的推论问题。第二个复杂程度是规模;恒星目录有数以百万计的观测数据。为了应对这些挑战,我们开发了轨迹模型和推论,一种根据星变变变变变率综合观测的可缩缩缩放方法。我们研究了合成数据和Ananke数据集,这是几百万颗恒星的高不振感应感的模型。