Distributed tensor decomposition (DTD) is a fundamental data-analytics technique that extracts latent important properties from high-dimensional multi-attribute datasets distributed over edge devices. Conventionally its wireless implementation follows a one-shot approach that first computes local results at devices using local data and then aggregates them to a server with communication-efficient techniques such as over-the-air computation (AirComp) for global computation. Such implementation is confronted with the issues of limited storage-and-computation capacities and link interruption, which motivates us to propose a framework of on-the-fly communication-and-computing (FlyCom$^2$) in this work. The proposed framework enables streaming computation with low complexity by leveraging a random sketching technique and achieves progressive global aggregation through the integration of progressive uploading and multiple-input-multiple-output (MIMO) AirComp. To develop FlyCom$^2$, an on-the-fly sub-space estimator is designed to take real-time sketches accumulated at the server to generate online estimates for the decomposition. Its performance is evaluated by deriving both deterministic and probabilistic error bounds using the perturbation theory and concentration of measure. Both results reveal that the decomposition error is inversely proportional to the population of sketching observations received by the server. To further rein in the noise effect on the error, we propose a threshold-based scheme to select a subset of sufficiently reliable received sketches for DTD at the server. Experimental results validate the performance gain of the proposed selection algorithm and show that compared to its one-shot counterparts, the proposed FlyCom$^2$ achieves comparable (even better in the case of large eigen-gaps) decomposition accuracy besides dramatically reducing devices' complexity costs.
翻译:分布式电磁分解( DTD) 是一种基本的数据分析技术, 它从在边缘设备上分布的高维多属性数据集中提取潜在重要属性。 常规地, 它的无线实施采用一次性方法, 首先利用本地数据在设备上计算本地结果, 然后用通信效率高的技术, 如用于全球计算的超空计算( AirComp) 将其汇总到服务器。 这样的实施遇到了存储和计算能力有限以及链接中断的问题, 这促使我们在此工作中提出一个在高维多属性分布的多属性数据集中潜在重要属性的框架( FlyCompresident liveralalalalalal- liderational- setrial lider- setective (FlyComplement) 。 拟议的框架能够利用随机草图技术来进行低复杂性的流计算, 并通过渐进式上传和多次投影输出( IMIMO) 等整合, 开发FlyCommissional $2$, 一个基于运行的子空间预算算器在服务器上进行实时累积, 以提出在选择性观测结果, 并用直观测结果显示结果显示。 度显示一个稳定性变压结果, 度显示, 度的运行结果, 通过评估, 显示, 正在显示, 显示一个稳定的平整算结果显示, 进行进一步的算算。</s>