Optimal viewpoint prediction is an essential task in many computer graphics applications. Unfortunately, common viewpoint qualities suffer from two major drawbacks: dependency on clean surface meshes, which are not always available, and the lack of closed-form expressions, which requires a costly search involving rendering. To overcome these limitations we propose to separate viewpoint selection from rendering through an end-to-end learning approach, whereby we reduce the influence of the mesh quality by predicting viewpoints from unstructured point clouds instead of polygonal meshes. While this makes our approach insensitive to the mesh discretization during evaluation, it only becomes possible when resolving label ambiguities that arise in this context. Therefore, we additionally propose to incorporate the label generation into the training procedure, making the label decision adaptive to the current network predictions. We show how our proposed approach allows for learning viewpoint predictions for models from different object categories and for different viewpoint qualities. Additionally, we show that prediction times are reduced from several minutes to a fraction of a second, as compared to state-of-the-art (SOTA) viewpoint quality evaluation. We will further release the code and training data, which will to our knowledge be the biggest viewpoint quality dataset available.
翻译:最佳观点预测是许多计算机图形应用中的一项基本任务。 不幸的是,共同观点质量存在两大缺陷:依赖不总是可用的清洁表面胶片,缺乏封闭式表达方式,这需要花费大量时间的搜索。为了克服这些局限性,我们建议通过端到端学习方法,将观点选择与观点选择分开,从而通过预测非结构点云而不是多边形网外线云的观点来减少网外质量的影响。这使我们的方法在评价期间对网状分解不敏感,但只有在解决这一背景下出现的标签模糊性时才有可能。因此,我们还建议将标签生成纳入培训程序,使标签决定适应当前的网络预测。我们展示我们拟议的方法如何允许学习不同对象类别和不同观点质量模型的观点预测。此外,我们显示预测时间从几分钟减少到了第二点的一小部分,与最新技术(SOTA)的高质量评估相比,我们将进一步发布代码和培训数据,这将使我们的知识质量观点成为最高观点。