Designing and generating new data under targeted properties has been attracting various critical applications such as molecule design, image editing and speech synthesis. Traditional hand-crafted approaches heavily rely on expertise experience and intensive human efforts, yet still suffer from the insufficiency of scientific knowledge and low throughput to support effective and efficient data generation. Recently, the advancement of deep learning induces expressive methods that can learn the underlying representation and properties of data. Such capability provides new opportunities in figuring out the mutual relationship between the structural patterns and functional properties of the data and leveraging such relationship to generate structural data given the desired properties. This article provides a systematic review of this promising research area, commonly known as controllable deep data generation. Firstly, the potential challenges are raised and preliminaries are provided. Then the controllable deep data generation is formally defined, a taxonomy on various techniques is proposed and the evaluation metrics in this specific domain are summarized. After that, exciting applications of controllable deep data generation are introduced and existing works are experimentally analyzed and compared. Finally, the promising future directions of controllable deep data generation are highlighted and five potential challenges are identified.
翻译:在目标特性下设计和生成新数据,吸引了分子设计、图像编辑和语音合成等各种关键应用,传统手工制作方法在很大程度上依赖专业知识经验和人类密集努力,但仍因科学知识不足和低输送量不足而受到影响,以支持有效和高效的数据生成;最近,深层次学习的推进引出了能够了解数据基本代表性和特性的直观方法;这种能力为了解数据的结构模式和功能特性之间的相互关系和利用这种关系产生结构性数据提供了新的机会;本篇文章系统地审查了这一前景良好的研究领域,通常称为可控深海数据生成;首先,提出了潜在的挑战,并提供了初步数据;随后,正式确定了可控深度数据生成,提出了各种技术分类法,并总结了这一具体领域的评价指标;之后,引入了可控深层数据生成的令人振奋奋人心的应用,对现有工作进行了实验性分析和比较;最后,突出了可控深海数据生成的前景,并确定了五项潜在挑战。