This work introduces Jacobian-scaled K-means (JSK-means) clustering, which is a physics-informed clustering strategy centered on the K-means framework. The method allows for the injection of underlying physical knowledge into the clustering procedure through a distance function modification: instead of leveraging conventional Euclidean distance vectors, the JSK-means procedure operates on distance vectors scaled by matrices obtained from dynamical system Jacobians evaluated at the cluster centroids. The goal of this work is to show how the JSK-means algorithm -- without modifying the input dataset -- produces clusters that capture regions of dynamical similarity, in that the clusters are redistributed towards high-sensitivity regions in phase space and are described by similarity in the source terms of samples instead of the samples themselves. The algorithm is demonstrated on a complex reacting flow simulation dataset (a channel detonation configuration), where the dynamics in the thermochemical composition space are known through the highly nonlinear and stiff Arrhenius-based chemical source terms. Interpretations of cluster partitions in both physical space and composition space reveal how JSK-means shifts clusters produced by standard K-means towards regions of high chemical sensitivity (e.g., towards regions of peak heat release rate near the detonation reaction zone). The findings presented here illustrate the benefits of utilizing Jacobian-scaled distances in clustering techniques, and the JSK-means method in particular displays promising potential for improving former partition-based modeling strategies in reacting flow (and other multi-physics) applications.
翻译:本研究介绍了基于雅可比矩阵缩放的K均值聚类(JSK-means),这是一个中心在K均值框架下的物理驱动聚类策略。该方法通过修改距离函数允许将底层的物理知识注入到聚类过程中:JSK均值聚类不使用传统的欧几里得距离向量,而是操作在通过在聚类中心处求解的指标系统雅可比矩阵进行缩放的距离向量上。本文的目标是表明JSK均值算法,在不修改输入数据集的情况下,能够产生捕获动态相似性区域的聚类,即聚类对扰动后的相空间高敏度区域进行重新分配,并且由样本的源项相似性描述。在复杂的反应流仿真数据集(一个通道爆轰配置)上演示了算法,其中热化学组成空间的动态通过高度非线性和硬化的阿伦尼乌斯化学源项得知。在物理空间和组成空间中对聚类分区的解释显示JSK均值聚类将通过标准K均值产生的聚类移向高化学灵敏度区域(例如在爆轰反应区域附近的峰值放热率区域)。这里呈现的发现说明了在聚类技术中使用雅可比矩阵缩放距离的好处,特别是JSK均值方法在反应流(和其他多物理)应用上显示出有前途的潜力,以改善以前基于分区的建模策略。