Model efficiency is a critical aspect of developing and deploying machine learning models. Inference time and latency directly affect the user experience, and some applications have hard requirements. In addition to inference costs, model training also have direct financial and environmental impacts. Although there are numerous well-established metrics (cost indicators) for measuring model efficiency, researchers and practitioners often assume that these metrics are correlated with each other and report only few of them. In this paper, we thoroughly discuss common cost indicators, their advantages and disadvantages, and how they can contradict each other. We demonstrate how incomplete reporting of cost indicators can lead to partial conclusions and a blurred or incomplete picture of the practical considerations of different models. We further present suggestions to improve reporting of efficiency metrics.
翻译:模型效率是开发和部署机器学习模型的一个重要方面。推断时间和时间间隔直接影响到用户的经验,有些应用有困难的要求。除了推断费用外,模型培训还具有直接的金融和环境影响。虽然衡量模型效率的既定衡量标准(成本指标)很多,但研究人员和从业人员往往认为这些衡量标准彼此相关,只报告其中的很少。在本文件中,我们透彻地讨论了共同费用指标、其优缺点以及它们如何相互矛盾。我们表明,关于成本指标的报告不完整,能够如何得出部分结论,并模糊或不完整地说明不同模型的实际考虑。我们还提出了改进效率指标报告的建议。