Recently proposed consistency-based Semi-Supervised Learning (SSL) methods such as the $\Pi$-model, temporal ensembling, the mean teacher, or the virtual adversarial training, have advanced the state of the art in several SSL tasks. These methods can typically reach performances that are comparable to their fully supervised counterparts while using only a fraction of labelled examples. Despite these methodological advances, the understanding of these methods is still relatively limited. In this text, we analyse (variations of) the $\Pi$-model in settings where analytically tractable results can be obtained. We establish links with Manifold Tangent Classifiers and demonstrate that the quality of the perturbations is key to obtaining reasonable SSL performances. Importantly, we propose a simple extension of the Hidden Manifold Model that naturally incorporates data-augmentation schemes and offers a framework for understanding and experimenting with SSL methods.
翻译:最近提出的基于一致性的半强化学习方法,如$Pi美元模型、时间组合、平均教师或虚拟对抗性培训等,在几个SSL任务中提高了最新水平,这些方法通常可以达到与其完全监督的对应人员相当的效绩,但只使用一部分贴有标签的例子。尽管在方法上取得了这些进展,对这些方法的理解仍然相对有限。在本案文中,我们分析(变换)在可以获得可分析可取结果的环境下的$\Pi美元模型。我们与Manifold Tangent分类者建立了联系,并表明扰动质量是获得合理的SSL业绩的关键。重要的是,我们建议简单扩展隐藏式模型,自然纳入数据放大计划,并为了解和试验SSL方法提供一个框架。