We leverage state-of-the-art machine learning methods and a decade's worth of archival data from the Canada-France-Hawaii Telescope (CFHT) to predict observatory image quality (IQ) from environmental conditions and observatory operating parameters. Specifically, we develop accurate and interpretable models of the complex dependence between data features and observed IQ for CFHT's wide field camera, MegaCam. Our contributions are several-fold. First, we collect, collate and reprocess several disparate data sets gathered by CFHT scientists. Second, we predict probability distribution functions (PDFs) of IQ, and achieve a mean absolute error of $\sim0.07''$ for the predicted medians. Third, we explore data-driven actuation of the 12 dome ``vents'', installed in 2013-14 to accelerate the flushing of hot air from the dome. We leverage epistemic and aleatoric uncertainties in conjunction with probabilistic generative modeling to identify candidate vent adjustments that are in-distribution (ID) and, for the optimal configuration for each ID sample, we predict the reduction in required observing time to achieve a fixed SNR. On average, the reduction is $\sim15\%$. Finally, we rank sensor data features by Shapley values to identify the most predictive variables for each observation. Our long-term goal is to construct reliable and real-time models that can forecast optimal observatory operating parameters for optimization of IQ. Such forecasts can then be fed into scheduling protocols and predictive maintenance routines. We anticipate that such approaches will become standard in automating observatory operations and maintenance by the time CFHT's successor, the Maunakea Spectroscopic Explorer (MSE), is installed in the next decade.
翻译:我们利用加拿大-法国-哈瓦伊望远镜(CFHT)十年来最先进的机器学习方法和价值的档案数据,从环境条件和观测运行参数中预测观测台图像质量(IQ),具体地说,我们开发了数据特征之间复杂依赖的准确和可解释模型,并观测了CFHT广域摄影机MegaCam的IQ。我们的贡献是多方面的。首先,我们收集、整理和重新处理由CFHT科学家收集的若干不同数据集。第二,我们预测IQ的概率分布函数(PDFs),从环境条件和观测观测运行参数中得出绝对误差为$\sim0.7'。第三,我们开发了数据驱动模型,显示数据驱动器数据在数据上,2013-14年安装了12个“venet'。我们利用感应感知和感知的不确定性与预测模型一起,确定候选人口调整在发送(ID)中,而对于每个运行量的预测值而言,最优化的OFMER值在运行模型中将达到最佳的运行速度,我们最终将用SIMSIMIMO数据来预测。