The ocean is filled with microscopic microalgae called phytoplankton, which together are responsible for as much photosynthesis as all plants on land combined. Our ability to predict their response to the warming ocean relies on understanding how the dynamics of phytoplankton populations is influenced by changes in environmental conditions. One powerful technique to study the dynamics of phytoplankton is flow cytometry, which measures the optical properties of thousands of individual cells per second. Today, oceanographers are able to collect flow cytometry data in real-time onboard a moving ship, providing them with fine-scale resolution of the distribution of phytoplankton across thousands of kilometers. One of the current challenges is to understand how these small and large scale variations relate to environmental conditions, such as nutrient availability, temperature, light and ocean currents. In this paper, we propose a novel sparse mixture of multivariate regressions model to estimate the time-varying phytoplankton subpopulations while simultaneously identifying the specific environmental covariates that are predictive of the observed changes to these subpopulations. We demonstrate the usefulness and interpretability of the approach using both synthetic data and real observations collected on an oceanographic cruise conducted in the north-east Pacific in the spring of 2017.
翻译:海洋中充斥着称为浮游植物的微型微藻类,它们共同负责与陆地上所有植物加在一起的光合作用。我们预测它们对变暖海洋的反应的能力取决于了解浮游植物群的动态如何受到环境条件变化的影响。研究浮游植物动态的一种强大技术是测流细胞测量法,测量每秒数千个单细胞的光学特性。今天,海洋学家能够在移动的船上实时收集流动的细胞测量数据,向它们提供数千公里间浮游植物分布的精确分辨率。目前的挑战之一是了解这些大小变化如何与环境条件(如营养物的可得性、温度、光和洋流)相关。在本文件中,我们提出了一种新颖的多变化回归模型混合物,以估计浮游植物在时间变化的亚人口,同时确定正在预测这些亚人口所观察到的变化的具体环境变量。我们用合成数据以及201717年期在东北进行的海洋航行方法中,我们展示了在采集的合成数据及对2017年期航行中进行的海洋航行方法的实用性和可解释性。