Compressed semantic representation of source is essentially important to accomplish various artificial intelligent (AI) tasks in task-oriented semantic communication (TOSC). In this paper, by extending the rate-distortion theory to multiple tasks, we propose a TOSC scheme with semantic reconstruction (SR), named as TOSC-SR, in the joint source and channel coding (JSCC) framework. Besides extracting and compressing task semantics, our basic idea here is to reconstruct images with task semantics rather than traditionally in the pixels or features. The main purpose is to share the semantic-reconstructed images among multiple tasks with enhanced generalization ability under certain rate. We formulate the TOSC-SR scheme as a rate-distortion optimization problem, where a novel semantic distortion measurement is defined by mutual information of source, the semantic-reconstructed images, and task labels, pairwise. We derive an analytic solution for the formulated problem, where the self-consistent equations are obtained to determine the optimal mapping of source and the semantic-reconstructed images by taking task labels into account. In the TOSC-SR scheme which is feasible in practice, a relaxed version of loss function is derived based on variational approximation of mutual information. Then we adopt the classification task to train TOSC-SR, and the object detection task to validate the generalization ability. Experimental results show that the proposed TOSC-SR scheme outperforms conventional JPEG, JPEG2000 based communication schemes and deep learning based TOSC with general reconstruction schemes in terms of reconstruction quality, classification and object detection performance at the same source compression ratio and signal-to-noise (SNR) regime.
翻译:源的压缩语义表达方式对于在任务导向语义通信中完成各种人工智能(AI)任务十分重要。 在本文中,通过将率扭曲理论扩展至多重任务,我们提出了一个以语义重建(SR)为名称的TOSC计划,在联合源和频道编码(JSCC)框架内,称为TOSC-SR。除了提取和压缩任务语义外,我们在这里的基本想法是用任务语义而不是传统的像素或特征来重建图像。主要目的是在多个任务中共享语义目标再构造图像,在特定速率下,以强化的通化能力共享。我们把TOSC-SR计划设计成一个以语义重建(SRS)为代言义重建(TOSC-S-SR)方案,在源的相互信息、语义再校正的图像和任务分类中,我们用自相兼容的方程式来决定最优化的语义目的再解图像源和精度再解变校程。