We leverage state-of-the-art machine learning methods and a decade's worth of archival data from CFHT to predict observatory image quality (IQ) from environmental conditions and observatory operating parameters. Specifically, we develop accurate and interpretable models of the complex dependence between data features and observed IQ for CFHT's wide-field camera, MegaCam. Our contributions are several-fold. First, we collect, collate and reprocess several disparate data sets gathered by CFHT scientists. Second, we predict probability distribution functions (PDFs) of IQ and achieve a mean absolute error of $\sim0.07''$ for the predicted medians. Third, we explore the data-driven actuation of the 12 dome "vents" installed in 2013-14 to accelerate the flushing of hot air from the dome. We leverage epistemic and aleatoric uncertainties in conjunction with probabilistic generative modeling to identify candidate vent adjustments that are in-distribution (ID); for the optimal configuration for each ID sample, we predict the reduction in required observing time to achieve a fixed SNR. On average, the reduction is $\sim12\%$. Finally, we rank input features by their Shapley values to identify the most predictive variables for each observation. Our long-term goal is to construct reliable and real-time models that can forecast optimal observatory operating parameters to optimize IQ. We can then feed such forecasts into scheduling protocols and predictive maintenance routines. We anticipate that such approaches will become standard in automating observatory operations and maintenance by the time CFHT's successor, the Maunakea Spectroscopic Explorer, is installed in the next decade.
翻译:我们利用CFHT最先进的机器学习方法和价值十年的档案数据,从环境条件和观测站运行参数中预测观测站图像质量(IQ),具体地说,我们开发了数据特征和观测到的CFHT广域相机“MegaCam”的IQ之间复杂依赖性准确和可解释的模型。我们的贡献有几倍。首先,我们收集、整理和重新处理由CFHT科学家收集的数套不同数据集。第二,我们预测IQ的概率分布功能(PDFs),从环境条件和观测站运行参数预测中得出绝对差值为$\sim0.7'。第三,我们探索了2013-14年安装的12个圆顶点“vents”之间复杂的数据依赖性和可解释性IQQ,以加速将热空气冲洗出该圆顶点。我们利用感应感应和感知的不确定性模型来确定正在分配的候选口服调整(ID),然后成为每个ID样本的最佳配置,我们预测到需要观测时间的减少时间以达到最终的SNR值。