In edge inference, an edge server provides remote-inference services to edge devices. This requires the edge devices to upload high-dimensional features of data samples over resource-constrained wireless channels, which creates a communication bottleneck. The conventional solution of feature pruning requires that the device has access to the inference model, which is unavailable in the current scenario of split inference. To address this issue, we propose the progressive feature transmission (ProgressFTX) protocol, which minimizes the overhead by progressively transmitting features until a target confidence level is reached. The optimal control policy of the protocol to accelerate inference is derived and it comprises two key operations. The first is importance-aware feature selection at the server, for which it is shown to be optimal to select the most important features, characterized by the largest discriminant gains of the corresponding feature dimensions. The second is transmission-termination control by the server for which the optimal policy is shown to exhibit a threshold structure. Specifically, the transmission is stopped when the incremental uncertainty reduction by further feature transmission is outweighed by its communication cost. The indices of the selected features and transmission decision are fed back to the device in each slot. The optimal policy is first derived for the tractable case of linear classification and then extended to the more complex case of classification using a convolutional neural network. Both Gaussian and fading channels are considered. Experimental results are obtained for both a statistical data model and a real dataset. It is seen that ProgressFTX can substantially reduce the communication latency compared to conventional feature pruning and random feature transmission.
翻译:在边缘推断中, 边缘服务器为边缘设备提供远程推断服务 。 这要求边缘设备在资源限制的无线频道上传数据样本的高维特征, 从而造成通信瓶颈。 常规功能调整解决方案要求设备能够访问当前分解推理假设中无法使用的推断模型。 为了解决这个问题, 我们提议了渐进特征传输( 进步FTX) 协议, 通过在达到目标信任度之前逐步传输功能, 最大限度地减少管理管理管理。 协议加速推断的最佳控制政策由两个关键操作组成。 第一个是服务器的重要觉悟特征选择, 其特点是选择最重要的特征, 其特点是以相应的特性大小的最大差异性增益为特征。 第二个是服务器的传输- 传输控制, 其最佳政策显示的显示为一个阈值结构。 具体地, 当通过进一步的特性传输来减少递增的不确定性时, 其最优的控制政策政策化政策是产生, 所选择的特性和传输决定的指数在服务器上显示的最优性能, 将数据递增的递归为每个数据序列 。 最优的递增的顺序分析结果, 和最精确的递增的递增的递归的序列 。