Spatiotemporal data mining (STDM) has a wide range of applications in various complex physical systems (CPS), i.e., transportation, manufacturing, healthcare, etc. Among all the proposed methods, the Convolutional Long Short-Term Memory (ConvLSTM) has proved to be generalizable and extendable in different applications and has multiple variants achieving state-of-the-art performance in various STDM applications. However, ConvLSTM and its variants are computationally expensive, which makes them inapplicable in edge devices with limited computational resources. With the emerging need for edge computing in CPS, efficient AI is essential to reduce the computational cost while preserving the model performance. Common methods of efficient AI are developed to reduce redundancy in model capacity (i.e., model pruning, compression, etc.). However, spatiotemporal data mining naturally requires extensive model capacity, as the embedded dependencies in spatiotemporal data are complex and hard to capture, which limits the model redundancy. Instead, there is a fairly high level of data and feature redundancy that introduces an unnecessary computational burden, which has been largely overlooked in existing research. Therefore, we developed a novel framework SparseST, that pioneered in exploiting data sparsity to develop an efficient spatiotemporal model. In addition, we explore and approximate the Pareto front between model performance and computational efficiency by designing a multi-objective composite loss function, which provides a practical guide for practitioners to adjust the model according to computational resource constraints and the performance requirements of downstream tasks.
翻译:时空数据挖掘(STDM)在各类复杂物理系统(CPS)中具有广泛的应用,例如交通、制造、医疗等领域。在现有方法中,卷积长短期记忆网络(ConvLSTM)已被证明在不同应用中具有通用性和可扩展性,其多种变体在各类STDM任务中均实现了最先进的性能。然而,ConvLSTM及其变体计算成本高昂,难以在计算资源受限的边缘设备上部署。随着CPS中对边缘计算需求的日益增长,发展高效人工智能技术以在保持模型性能的同时降低计算成本至关重要。现有高效AI方法通常致力于减少模型容量冗余(如模型剪枝、压缩等)。但时空数据挖掘天然需要较大的模型容量,因为时空数据中蕴含的依赖关系复杂且难以捕捉,这限制了模型冗余的压缩空间。相反,数据与特征层面存在相当程度的冗余,这些冗余带来了不必要的计算负担,而现有研究对此关注不足。为此,我们提出了一种新颖的框架SparseST,率先利用数据稀疏性构建高效的时空模型。此外,通过设计多目标复合损失函数,我们探索并逼近了模型性能与计算效率之间的帕累托前沿,为实践者根据计算资源限制与下游任务性能需求调整模型提供了实用指导。