This paper examines the maximum code rate achievable by a data-driven communication system over some unknown discrete memoryless channel in the finite blocklength regime. A class of channel codes, called learning-based channel codes, is first introduced. Learning-based channel codes include a learning algorithm to transform the training data into a pair of encoding and decoding functions that satisfy some statistical reliability constraint. Data-dependent achievability and converse bounds in the non-asymptotic regime are established for this class of channel codes. It is shown analytically that the asymptotic expansion of the bounds for the maximum achievable code rate of the learning-based channel codes are tight for sufficiently large training data.
翻译:本文研究了数据驱动通信系统在某些未知离散无记忆通道上的最大编码速率在有限块长区间内的可实现性。首先引入了一种称为学习型通道编码的编码类别。学习型通道编码包括一个学习算法,用于将训练数据转化为一对编码和解码函数,以满足某些统计可靠性约束。本文为这种通道编码类别建立了非渐近区间内的数据相关可实现性和反可实现性界限。分析表明,在有足够大的训练数据时,学习型通道编码的最大可实现编码速率的极限扩展边界是紧的。