Inferring programs which generate 2D and 3D shapes is important for reverse engineering, enabling shape editing, and more. Supervised learning is hard to apply to this problem, as paired (program, shape) data rarely exists. Recent approaches use supervised pre-training with randomly-generated programs and then refine using self-supervised learning. But self-supervised learning either requires that the program execution process be differentiable or relies on reinforcement learning, which is unstable and slow to converge. In this paper, we present a new approach for learning to infer shape programs, which we call latent execution self training (LEST). As with recent prior work, LEST starts by training on randomly-generated (program, shape) pairs. As its name implies, it is based on the idea of self-training: running a model on unlabeled input shapes, treating the predicted programs as ground truth latent labels, and training again. Self-training is known to be susceptible to local minima. LEST circumvents this problem by leveraging the fact that predicted latent programs are executable: for a given shape $\mathbf{x}^* \in S^*$ and its predicted program $\mathbf{z} \in P$, we execute $\mathbf{z}$ to obtain a shape $\mathbf{x} \in S$ and train on $(\mathbf{z} \in P, \mathbf{x} \in S)$ pairs, rather than $(\mathbf{z} \in P, \mathbf{x}^* \in S^*)$ pairs. Experiments show that the distribution of executed shapes $S$ converges toward the distribution of real shapes $S^*$. We establish connections between LEST and algorithms for learning generative models, including variational Bayes, wake sleep, and expectation maximization. For constructive solid geometry and assembly-based modeling, LEST's inferred programs converge to high reconstruction accuracy significantly faster than those of reinforcement learning.
翻译:生成 2D 和 3D 形状的导出程序对于反向工程、 使形状编辑更强 。 受监督的学习很难适用于这一问题, 因为配对( 程序、 形状) 数据很少存在。 最近的方法使用随机生成的程序来监督预培训, 然后使用自监督的学习来改进。 但自我监督的学习要么要求程序执行过程具有差异性, 要么依靠强化学习, 而这种学习是不稳定和缓慢的。 在本文中, 我们提出一种新的方法来学习如何构造程序, 我们称之为潜在执行自我培训( LEST ) 。 正如最近的工作一样, LEST 以随机生成的( 程序、 形状、 形状、 程序、 随机生成的模型, 将预测的程序作为地面真相潜伏标签处理, 并且再次培训。 已知的是, 自我培训对本地的 和 美元正值 。 LEST 将回避这一问题, 利用预测的潜在程序是可执行的 $( 美元)、 Smath\\\\\ max 的元) 的变现成, Smax 。