The ability to generate synthetic sequences is crucial for a wide range of applications, and recent advances in deep learning architectures and generative frameworks have greatly facilitated this process. Particularly, unconditional one-shot generative models constitute an attractive line of research that focuses on capturing the internal information of a single image or video to generate samples with similar contents. Since many of those one-shot models are shifting toward efficient non-deep and non-adversarial approaches, we examine the versatility of a one-shot generative model for augmenting whole datasets. In this work, we focus on how similarity at the subsequence level affects similarity at the sequence level, and derive bounds on the optimal transport of real and generated sequences based on that of corresponding subsequences. We use a one-shot generative model to sample from the vicinity of individual sequences and generate subsequence-similar ones and demonstrate the improvement of this approach by applying it to the problem of Unmanned Aerial Vehicle (UAV) identification using limited radio-frequency (RF) signals. In the context of UAV identification, RF fingerprinting is an effective method for distinguishing legitimate devices from malicious ones, but heterogenous environments and channel impairments can impose data scarcity and affect the performance of classification models. By using subsequence similarity to augment sequences of RF data with a low ratio (5%-20%) of training dataset, we achieve significant improvements in performance metrics such as accuracy, precision, recall, and F1 score.
翻译:生成合成序列的能力对于广泛的应用至关重要,并且最近深度学习结构和生成框架的进步大大促进了此过程。尤其是,无条件的一次性生成模型构成了一个有吸引力的研究方向,重点在于捕获一张图像或视频的内部信息,以生成具有类似内容的样本。由于许多这些一次性模型正在转向高效的非深度学习和非对抗性方法,因此我们检查了一次性生成模型增强整个数据集的多功能性。在这项工作中,我们关注子序列级别的相似性如何影响序列级别的相似性,并根据相应子序列的最优传输推导出实际和生成序列之间最优传输的界限。我们使用一次性生成模型从单个序列的周围进行采样并生成类似子序列的序列,并通过将其应用于使用有限射频(RF)信号的无人机识别问题来证明此方法的改进。在无人机识别的背景下,RF指纹识别是区分合法设备和恶意设备的有效方法,但异构环境和信道失真可以导致数据稀缺并影响分类模型的性能。通过使用子序列相似性来增加具有低训练数据集比率(5%-20%)的RF数据序列,我们获得了显着的性能指标,例如准确性,精度,召回率和F1分数的改进。