利用不可转让的预培训模式规范化的有限数据深入生成模型 (Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models)

Deep generative models (DGMs) are data-eager because learning a complex model on limited data suffers from a large variance and easily overfits. Inspired by the classical perspective of the bias-variance tradeoff, we propose regularized deep generative model (Reg-DGM), which leverages a nontransferable pre-trained model to reduce the variance of generative modeling with limited data. Formally, Reg-DGM optimizes a weighted sum of a certain divergence and the expectation of an energy function, where the divergence is between the data and the model distributions, and the energy function is defined by the pre-trained model w.r.t. the model distribution. We analyze a simple yet representative Gaussian-fitting case to demonstrate how the weighting hyperparameter trades off the bias and the variance. Theoretically, we characterize the existence and the uniqueness of the global minimum of Reg-DGM in a non-parametric setting and prove its convergence with neural networks trained by gradient-based methods. Empirically, with various pre-trained feature extractors and a data-dependent energy function, Reg-DGM consistently improves the generation performance of strong DGMs with limited data and achieves competitive results to the state-of-the-art methods.

翻译：深基因模型(DGM)是数据管理员,因为了解关于有限数据的复杂模型存在巨大差异,而且容易过度使用。根据偏差取舍的古典观点,我们建议采用常规化的深基因模型(Reg-DGM),利用非转让的预先培训模型,利用有限数据减少基因模型的差异。形式上,Reg-DGM优化了某种差异的加权和对能源功能的预期,这种差异是数据和模型分布之间的差异,而能源功能是由预先培训的模型 w.r.t. 模型分布界定的。我们分析一个简单但有代表性的Gaussian模型,以展示超参数交易的权重如何摆脱偏差和差异。理论上,我们用非参数设置来描述Reg-DGM全球最低值的存在和独特性,并证明它与由梯度法方法培训的神经网络的趋同性。具有活力的,由各种预先培训的特征提取器和依赖数据的能源功能界定。我们分析一个简单但有代表性的、具有代表性的Gaussian-DGM(Reg-D-DGM)持续地改进了数据生成的强度。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日