以共同语言进行目标导向的语义交流学习课程 (Curriculum Learning for Goal-Oriented Semantic Communications with a Common Language)

Goal-oriented semantic communication will be a pillar of next-generation wireless networks. Despite significant recent efforts in this area, most prior works are focused on specific data types (e.g., image or audio), and they ignore the goal and effectiveness aspects of semantic transmissions. In contrast, in this paper, a holistic goal-oriented semantic communication framework is proposed to enable a speaker and a listener to cooperatively execute a set of sequential tasks in a dynamic environment. A common language based on a hierarchical belief set is proposed to enable semantic communications between speaker and listener. The speaker, acting as an observer of the environment, utilizes the beliefs to transmit an initial description of its observation (called event) to the listener. The listener is then able to infer on the transmitted description and complete it by adding related beliefs to the transmitted beliefs of the speaker. As such, the listener reconstructs the observed event based on the completed description, and it then takes appropriate action in the environment based on the reconstructed event. An optimization problem is defined to determine the perfect and abstract description of the events while minimizing the transmission and inference costs with constraints on the task execution time and belief efficiency. Then, a novel bottom-up curriculum learning (CL) framework based on reinforcement learning is proposed to solve the optimization problem and enable the speaker and listener to gradually identify the structure of the belief set and the perfect and abstract description of the events. Simulation results show that the proposed CL method outperforms traditional RL in terms of convergence time, task execution cost and time, reliability, and belief efficiency.

翻译：以目标为导向的语义通信将是下一代无线网络的一个支柱。尽管最近在这一领域做出了大量努力,但大多数先前的作品都侧重于特定的数据类型(如图像或音频),忽视了语义传输的目标和有效性方面。与此形成对照的是,本文件提出了一个面向目标的语义通信整体框架,以使发言者和听众能够在动态环境中合作执行一系列相继任务。提出了基于等级观念集的共同语言,以便能够在演讲者和听众之间进行语义通信。发言者作为环境观察员,利用这些信念向听众传递关于其观测(所谓事件)的初步描述。然后,听众能够通过将相关信念添加到演讲者所传递的信仰中来推断出所传递的语义和有效性。因此,听众根据完整描述对所观察到的事件进行重新整理,然后根据重建事件在环境中采取适当行动。优化问题被确定事件的完整性和抽象描述,同时尽量减少其观察(所谓事件)的“事件的”内容的“内容的“内容”初步描述和“内容”的缩略性描述,然后根据任务、时间框架的缩略度和“时间定义”的缩略度解释,根据任务执行方式和“时间定义,确定一个基于任务执行的“格式”的“时间定义和“格式”的“格式”定义,使任务定义,使演讲的准确理解和“理解”的“理解和“理解”的“格式”定义,使任务”的准确性任务”的“理解和“时间定义,使任务”的“格式能够使任务”的“理解”的“理解”的“理解”的“理解和“格式”的“格式能够使任务”的“格式”的“格式”的“、”能够使任务”的“理解和“格式”的“理解”的“、”的“、”的“理解”的“、“根据”的“、”的“理解和“理解”的“、”的进度”的“理解”的“格式能”的进度”的“理解和“的拟议理解和“理解和“的拟议理解和“格式”的进度”的进度”的进度”的进度”的“、”的“、”的进度”的进度”的进度”的进度”的进度”的“理解和“为”的进度”的进度”的难度和“为“为”的“为”的“理解和“理解和“理解”定义,使”的