Deep architecture have proven capable of solving many tasks provided a sufficient amount of labeled data. In fact, the amount of available labeled data has become the principal bottleneck in low label settings such as Semi-Supervised Learning. Mixing Data Augmentations do not typically yield new labeled samples, as indiscriminately mixing contents creates between-class samples. In this work, we introduce the SciMix framework that can learn to generator to embed a semantic style code into image backgrounds, we obtain new mixing scheme for data augmentation. We then demonstrate that SciMix yields novel mixed samples that inherit many characteristics from their non-semantic parents. Afterwards, we verify those samples can be used to improve the performance semi-supervised frameworks like Mean Teacher or Fixmatch, and even fully supervised learning on a small labeled dataset.
翻译:深层结构已证明有能力解决许多任务,提供了足够数量的标签数据。事实上,现有标签数据的数量已成为低标签设置中的主要瓶颈,比如半超学习。混合数据增加通常不会产生新的标签样本,因为不加区分地混合内容会在类别间产生样本。在这项工作中,我们引入了SciMix框架,可以让生成者学会将语义风格代码嵌入图像背景,我们获得了新的数据增强混合计划。然后我们证明SciMix生成了新颖的混合样本,这些样本继承了非语义性父母的许多特征。随后,我们核实这些样本可以用来改进半监督框架,如Meam教师或Fixmatch,甚至可以完全监督地学习一个小型标签数据集。