In most scientific domains, the deep learning community has largely focused on the quality of deep generative models, resulting in highly accurate and successful solutions. However, this race for quality comes at a tremendous computational cost, which incurs vast energy consumption and greenhouse gas emissions. At the heart of this problem are the measures that we use as a scientific community to evaluate our work. In this paper, we suggest relying on a multi-objective measure based on Pareto optimality, which takes into account both the quality of the model and its energy consumption. By applying our measure on the current state-of-the-art in generative audio models, we show that it can drastically change the significance of the results. We believe that this type of metric can be widely used by the community to evaluate their work, while putting computational cost -- and in fine energy consumption -- in the spotlight of deep learning research.
翻译:在大多数科学领域,深层次的学习界主要侧重于深层基因模型的质量,从而产生高度准确和成功的解决方案。然而,这种质量竞赛的计算成本巨大,导致大量能源消耗和温室气体排放。这个问题的核心是我们作为一个科学界用来评价我们的工作的措施。在本文中,我们建议依靠基于Pareto最佳性的多目标措施,该措施既考虑到模型的质量,又考虑到其能源消耗。我们通过对目前基因化音频模型中的最新水平进行测量,表明它能够极大地改变结果的意义。我们认为,这种类型的计量方法可以被社区广泛用来评价其工作,同时将计算成本 -- -- 和精美的能源消耗 -- -- 放在深思熟虑的研究的焦点。