Convolutional neural networks (CNNs) have become commonplace in addressing major challenges in computer vision. Researchers are not only coming up with new CNN architectures but are also researching different techniques to improve the performance of existing architectures. However, there is a tendency to over-emphasize performance improvement while neglecting certain important variables such as simplicity, versatility, the fairness of comparisons, and energy efficiency. Overlooking these variables in architectural design and evaluation has led to research bias and a significantly negative environmental impact. Furthermore, this can undermine the positive impact of research in using deep learning models to tackle climate change. Here, we perform an extensive and fair empirical study of a number of proposed techniques to gauge the utility of each technique for segmentation and classification. Our findings restate the importance of favoring simplicity over complexity in model design (Occam's Razor). Furthermore, our results indicate that simple standardized practices can lead to a significant reduction in environmental impact with little drop in performance. We highlight that there is a need to rethink the design and evaluation of CNNs to alleviate the issue of research bias and carbon emissions.
翻译:研究者们不仅正在开发新的CNN架构,而且正在研究各种不同的技术来改进现有架构的性能,然而,有一种过度强调业绩改进而忽视某些重要变数的倾向,例如简单、多功能、比较的公平性和能源效率。在建筑设计和评估中忽略这些变数已导致研究偏差和对环境的显著负面影响。此外,这可能损害研究在利用深层学习模型应对气候变化方面的积极影响。在这里,我们广泛而公正地研究了为衡量每种技术在分化和分类方面的效用而提出的一些技术。我们的调查结果重申,在模型设计中必须简单易行(Occam's Razor)。此外,我们的结果表明,简单的标准化做法可以大大减少对环境的影响,而绩效则很少下降。我们强调需要重新考虑CNN的设计和评价,以缓解研究偏差和碳排放问题。