Potential societal and environmental effects such as the rapidly increasing resource use and the associated environmental impact, reproducibility issues, and exclusivity, the privatization of ML research leading to a public research brain-drain, a narrowing of the research effort caused by a focus on deep learning, and the introduction of biases through a lack of sociodemographic diversity in data and personnel caused by recent developments in machine learning are a current topic of discussion and scientific publications. However, these discussions and publications focus mainly on computer science-adjacent fields, including computer vision and natural language processing or basic ML research. Using bibliometric analysis of the complete and full-text analysis of the open-access literature, we show that the same observations can be made for applied machine learning in chemistry and biology. These developments can potentially affect basic and applied research, such as drug discovery and development, beyond the known issue of biased data sets.
翻译:这些讨论和出版物主要集中于计算机科学和相邻领域,包括计算机视觉和自然语言处理或基本ML研究。 我们利用对开放检索文献的完整和全文分析的比理学分析,表明对化学和生物学应用机器的学习可以提出同样的观察意见,这些发展动态可能会影响基本和应用研究,例如药物的发现和发展,超越已知的有偏见的数据集问题。