The problem of end-to-end learning of a communication system using an autoencoder -- consisting of an encoder, channel, and decoder modeled using neural networks -- has recently been shown to be an effective approach. A challenge faced in the practical adoption of this learning approach is that under changing channel conditions (e.g. a wireless link), it requires frequent retraining of the autoencoder in order to maintain a low decoding error rate. Since retraining is both time consuming and requires a large number of samples, it becomes impractical when the channel distribution is changing quickly. We propose to address this problem using a fast and sample-efficient (few-shot) domain adaptation method that does not change the encoder and decoder networks. Different from conventional training-time unsupervised or semi-supervised domain adaptation, here we have a trained autoencoder from a source distribution that we want to adapt (at test time) to a target distribution using only a small labeled dataset, and no unlabeled data. We focus on a generative channel model based on the Gaussian mixture density network (MDN), and propose a regularized, parameter-efficient adaptation of the MDN using a set of affine transformations. The learned affine transformations are then used to design an optimal transformation at the decoder input to compensate for the distribution shift, and effectively present to the decoder inputs close to the source distribution. Experiments on many simulated distribution changes common to the wireless setting, and a real mmWave FPGA testbed demonstrate the effectiveness of our method at adaptation using very few target domain samples. The code for our work can be found at: https://github.com/jayaram-r/domain-adaptation-autoencoder.
翻译:使用自动编码器(由使用神经网络模型的编码器、频道和解码器模型组成)的通信系统的端到端学习问题最近被证明是一种有效的方法。在实际采用这种学习方法时面临的挑战是,在改变频道条件(例如无线链接)下,它需要经常对自动编码器进行再培训,以保持低解码错误率。由于再培训既耗时又需要大量样本,当频道分配迅速变化时,它变得不切实际。我们提议使用快速和抽样高效(few-shot)域效率调整方法解决这一问题,但不会改变编码器和解码网络。与传统的未经监督或半超过域调整的培训时间不同,我们这里有一个经过训练的自动编码器,从我们想要(测试时间)调整到目标分配,仅使用一个小型的标签数据集,而没有未加标签的数据源。我们把一个基于Caghsal-readdal-dection 目标转换模型放在了我们的内部混合混合物和近度变异调网络上(MDMD),用一个常规的测试方法来调整。</s>