Recently, semantic communication has been widely applied in wireless image transmission systems as it can prioritize the preservation of meaningful semantic information in images over the accuracy of transmitted symbols, leading to improved communication efficiency. However, existing semantic communication approaches still face limitations in achieving considerable inference performance in downstream AI tasks like image recognition, or balancing the inference performance with the quality of the reconstructed image at the receiver. Therefore, this paper proposes a contrastive learning (CL)-based semantic communication approach to overcome these limitations. Specifically, we regard the image corruption during transmission as a form of data augmentation in CL and leverage CL to reduce the semantic distance between the original and the corrupted reconstruction while maintaining the semantic distance among irrelevant images for better discrimination in downstream tasks. Moreover, we design a two-stage training procedure and the corresponding loss functions for jointly optimizing the semantic encoder and decoder to achieve a good trade-off between the performance of image recognition in the downstream task and reconstructed quality. Simulations are finally conducted to demonstrate the superiority of the proposed method over the competitive approaches. In particular, the proposed method can achieve up to 56\% accuracy gain on the CIFAR10 dataset when the bandwidth compression ratio is 1/48.
翻译:近来,语义通信在无线图像传输系统上的应用越来越广泛,因为它能够在图像的语义信息的保存和传输符号的准确性之间做出权衡,从而提高通信效率。然而,现有的语义通信方法仍然在达到下游人工智能任务(如图像识别)的推理性能或在保持接收端重建图像质量的同时平衡推理性能方面存在局限性。因此,本文提出了一种基于对比学习(CL)的语义通信方法来克服这些限制。具体而言,我们将传输过程中的图像失真视为CL中的一种数据增强形式,并利用CL来降低原始图像与失真重建之间的语义距离,同时保持无关图像之间的语义距离,以更好地进行下游任务的区分。此外,我们设计了一个两阶段的训练过程以及相应的损失函数来联合优化语义编码器和译码器,以实现在下游任务中具有良好的推理性能和重建质量的平衡。最后进行了仿真实验,证明了所提出方法的优越性。特别是,在带宽压缩比为1/48时,所提出方法在CIFAR10数据集上能够获得高达56%的准确率提升。