In this work, we introduce CPLASS (Continuous Piecewise-Linear Approximation via Stochastic Search), an algorithm for detecting changes in velocity within multidimensional data. The one-dimensional version of this problem is known as the change-in-slope problem (see Fearnhead & Grose (2022), Baranowski et al. (2019)). Unlike traditional changepoint detection methods that focus on changes in mean, detecting changes in velocity requires a specialized approach due to continuity constraints and parameter dependencies, which frustrate popular algorithms like binary segmentation and dynamic programming. To overcome these difficulties, we introduce a specialized penalty function to balance improvements in likelihood due to model complexity, and a Markov Chain Monte Carlo (MCMC)-based approach with tailored proposal mechanisms for efficient parameter exploration. Our method is particularly suited for analyzing intracellular transport data, where the multidimensional trajectories of microscale cargo are driven by teams of molecular motors that undergo complex biophysical transitions. To ensure biophysical realism in the results, we introduce a speed penalty that discourages overfitted of short noisy segments while maintaining consistency in the large-sample limit. Additionally, we introduce a summary statistic called the Cumulative Speed Allocation, which is robust with respect to idiosyncracies of changepoint detection while maintaining the ability to discriminate between biophysically distinct populations.
翻译:本文提出CPLASS(基于随机搜索的连续分段线性逼近)算法,用于检测多维数据中的速度变化。该问题的一维形式被称为斜率变化问题(参见Fearnhead & Grose (2022)、Baranowski et al. (2019))。与专注于均值变化的传统变点检测方法不同,由于连续性约束和参数依赖性的存在,速度变化检测需要专门的方法,这使得二分分割和动态规划等常用算法难以适用。为克服这些困难,我们引入了一种专门设计的惩罚函数来平衡模型复杂度带来的似然改进,并采用基于马尔可夫链蒙特卡洛(MCMC)的方法,配合定制化的提议机制以实现高效的参数探索。本方法特别适用于分析细胞内运输数据,其中微尺度货物的多维轨迹由经历复杂生物物理转变的分子马达团队驱动。为确保结果的生物物理真实性,我们引入了速度惩罚项,以抑制对短噪声段的过拟合,同时保持大样本极限下的一致性。此外,我们提出了一种称为累积速度分配的汇总统计量,该统计量对变点检测的特异性具有鲁棒性,同时保持区分生物物理特性不同群体的能力。