大创型模型的可预测性和惊喜 (Predictability and Surprise in Large Generative Models)

Deep Ganguli,Danny Hernandez,Liane Lovitt,Nova DasSarma,Tom Henighan,Andy Jones,Nicholas Joseph,Jackson Kernion,Ben Mann,Amanda Askell,Yuntao Bai,Anna Chen,Tom Conerly,Dawn Drain,Nelson Elhage,Sheer El Showk,Stanislav Fort,Zac Hatfield-Dodds,Scott Johnston,Shauna Kravec,Neel Nanda,Kamal Ndousse,Catherine Olsson,Daniela Amodei,Dario Amodei,Tom Brown,Jared Kaplan,Sam McCandlish,Chris Olah,Jack Clark

from arxiv, Updated to reflect the version submitted (and accepted) to ACM FAccT '22. This update incorporates feedback from peer-review and fixes minor typos. See open access FAccT conference version at: https://dl.acm.org/doi/abs/10.1145/3531146.3533229

Large-scale pre-training has recently emerged as a technique for creating capable, general purpose, generative models such as GPT-3, Megatron-Turing NLG, Gopher, and many others. In this paper, we highlight a counterintuitive property of such models and discuss the policy implications of this property. Namely, these generative models have an unusual combination of predictable loss on a broad training distribution (as embodied in their "scaling laws"), and unpredictable specific capabilities, inputs, and outputs. We believe that the high-level predictability and appearance of useful capabilities drives rapid development of such models, while the unpredictable qualities make it difficult to anticipate the consequences of model deployment. We go through examples of how this combination can lead to socially harmful behavior with examples from the literature and real world observations, and we also perform two novel experiments to illustrate our point about harms from unpredictability. Furthermore, we analyze how these conflicting properties combine to give model developers various motivations for deploying these models, and challenges that can hinder deployment. We conclude with a list of possible interventions the AI community may take to increase the chance of these models having a beneficial impact. We intend this paper to be useful to policymakers who want to understand and regulate AI systems, technologists who care about the potential policy impact of their work, and academics who want to analyze, critique, and potentially develop large generative models.

翻译：最近出现了大规模培训前的大规模培训,作为创造有能力的、通用的、通用的、实用能力外观的模型的技术,例如GPT-3、Megatron-Trining NLG、Gopher和其他许多模型。在本文件中,我们强调这类模型的反直觉特性,并讨论这种财产的政策影响。也就是说,这些基因模型在广泛的培训分布(体现在其“规模化法”中)和不可预测的具体能力、投入和产出方面有着不同寻常的、可预见的损失组合。我们认为,高层次的可预测性和显示的有用能力推动这些模型的迅速发展,而不可预测的品质使得难以预测模型部署的后果。我们通过文献和现实世界观察的范例来举例说明这种组合如何导致对社会有害的行为。我们还进行两项新的实验,以说明我们关于不可预测性的伤害。此外,我们分析这些相互冲突的特性如何结合给模型开发者以各种动机,以及可能阻碍部署的挑战。我们最后指出,AI界可能采取的干预清单,以增加这些模型产生有益影响的机会。我们打算用这一文件来说明这种组合如何引领悟、分析其潜在的决策者和研究。我们想要分析他们的能力。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

2019年自然语言处理NLP亮点总结，29页pdf，NLP Year in Review — 2019 NLP highlights for the year 2019.

专知会员服务

69+阅读 · 2020年1月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日