Rapid advances in sensor, wireless communication, cloud computing and data science have brought unprecedented amount of data to assist transportation engineers and researchers in making better decisions. However, traffic data in reality often has corrupted or incomplete values due to detector and communication malfunctions. Data imputation is thus required to ensure the effectiveness of downstream data-driven applications. To this end, numerous tensor-based methods treating the imputation problem as the low-rank tensor completion (LRTC) have been attempted in previous works. To tackle rank minimization, which is at the core of the LRTC, most of aforementioned methods utilize the tensor nuclear norm (NN) as a convex surrogate for the minimization. However, the over-relaxation issue in NN refrains it from desirable performance in practice. In this paper, we define an innovative nonconvex truncated Schatten p-norm for tensors (TSpN) to approximate tensor rank and impute missing spatiotemporal traffic data under the LRTC framework. We model traffic data into a third-order tensor structure of (time intervals,locations (sensors),days) and introduce four complicated missing patterns, including random missing and three fiber-like missing cases according to the tensor mode-n fibers. Despite nonconvexity of the objective function in our model, we derive the global optimal solutions by integrating the alternating direction method of multipliers (ADMM) with generalized soft-thresholding (GST). In addition, we design a truncation rate decay strategy to deal with varying missing rate scenarios. Comprehensive experiments are finally conducted using real-world spatiotemporal datasets, which demonstrate that the proposed LRTC-TSpN method performs well under various missing cases, meanwhile outperforming other SOTA tensor-based imputation models in almost all scenarios.
翻译:传感器、无线通信、云计算和数据科学方面的快速进步带来了前所未有的数据数量,以协助运输工程师和研究人员做出更好的决定。然而,由于检测和通信故障,现实中的交通数据往往由于检测和通信故障而导致价值的腐蚀或不完整。因此,需要数据估算来确保下游数据驱动应用程序的有效性。为此,在以往的工程中尝试了许多基于压力的方法,将估算问题作为低压高压完成(TSpN)处理。在LRTC框架下,处理软软性最小化(这是LRTC的核心)问题,上述方法中的大多数都使用高压核规范(NNN)作为最小化的螺旋式代谢。然而,NNNN的过度宽松问题使得它无法在实际中取得理想的性表现。在本文中,我们定义了一种创新的不convex调高的Schatten p-normum(Ts),在LRTCFC框架下,我们将交通数据添加到第三种级的螺旋式变速结构结构结构结构(时间间隔、定位)中,运行最短的Sral-laforal-lax Stal-deal-deal-de dal-dede datede dated dated dated dated dated dated dated dakedddd dateddaldalddddddddddddddddddddddddalddd disaldaldddddddddddddddaldddddddddddddddalddalddddddddddddddalddddddd, dias, diad, diadaldaldddddddddddddddddddalddddddddaldaldaldaldaldaldaldddddddddaldddddddddddddddddddddalddddddddaldaldaldddddddddd,我们算,我们以