Recently years, the attempts on distilling mobile data into useful knowledge has been led to the deployment of machine learning algorithms at the network edge. Principal component analysis (PCA) is a classic technique for extracting the linear structure of a dataset, which is useful for feature extraction and data compression. In this work, we propose the deployment of distributed PCA over a multi-access channel based on the algorithm of stochastic gradient descent to learn the dominant feature space of a distributed dataset at multiple devices. Over-the-air aggregation is adopted to reduce the multi-access latency, giving the name over-the-air PCA. The novelty of this design lies in exploiting channel noise to accelerate the descent in the region around each saddle point encountered by gradient descent, thereby increasing the convergence speed of over-the-air PCA. The idea is materialized by proposing a power-control scheme which detects the type of descent region and controlling the level of channel noise accordingly. The scheme is proved to achieve a faster convergence rate than in the case without power control.
翻译:近年来,将移动数据提炼为有用知识的尝试已经导致在网络边缘部署机器学习算法。主元件分析(PCA)是提取数据集线性结构的经典技术,有助于地貌提取和数据压缩。在这项工作中,我们提议在基于随机梯度梯度下降的算法的多存取频道上部署分布式五氯苯甲醚,以了解多设备分布式数据集的主要特征空间。在空中集合是为了减少多存取性延缓度,给其命名为在空中的五氯苯甲醚。这一设计的新颖之处在于利用频道噪音加速梯度下降所遇到的每个临界点周围区域的下降速度,从而加快大气中五氯苯甲醚的趋同速度。通过提出一种能控系统,检测下层区域的类型并据此控制频道的噪音水平,使这一想法成为现实。该计划证明比没有电源控制的情况更快地实现汇合率。