We demonstrate a proof-of-concept of a large language model conducting corporate lobbying related activities. An autoregressive large language model (OpenAI's text-davinci-003) determines if proposed U.S. Congressional bills are relevant to specific public companies and provides explanations and confidence levels. For the bills the model deems as relevant, the model drafts a letter to the sponsor of the bill in an attempt to persuade the congressperson to make changes to the proposed legislation. We use hundreds of novel ground-truth labels of the relevance of a bill to a company to benchmark the performance of the model, which outperforms the baseline of predicting the most common outcome of irrelevance. We also benchmark the performance of the previous OpenAI GPT-3 model (text-davinci-002), which was the state-of-the-art model on many academic natural language tasks until text-davinci-003 was recently released. The performance of text-davinci-002 is worse than simply always predicting that a bill is irrelevant to a company. These results suggest that, as large language models continue to exhibit improved core natural language understanding capabilities, performance on corporate lobbying related tasks will continue to improve. If AI begins to influence law in a manner that is not a direct extension of human intentions, this threatens the critical role that law as information could play in aligning AI with humans. This paper explores how this is increasingly a possibility. Initially, AI is being used to simply augment human lobbyists. However, there may be a slow creep of less and less human oversight over automated assessments of policy ideas and the written communication to regulatory agencies and Congressional staffers. The core question raised is where to draw the line between human-driven and AI-driven policy influence.
翻译:自动递减的大型语言模式(OpenAI's text-davinci-003)决定美国国会法案是否与特定公共公司相关,并提供解释和信心水平。对于该法案,模型认为相关,模型向法案发起人起草一封信,试图说服国会议员修改拟议的立法。我们使用数百个新的地面真相标签,说明法案与公司基准监督模式业绩的相关性,该模式超过了预测最常见的不相干结果的基准。我们还将先前的OpenAI GPT-3(text-davinci-002)模式(text-davinci-002)的绩效作为基准,该模式是许多学术自然语言任务的最新模式,直到最近发布文本-davinci-003。文本-davinici-002的绩效比简单地预测法案与公司无关。这些结果继续表明,随着大型语言模型继续展示日益普遍地预测的货币政策愿景,不断提高的货币政策效果, 也就是在公司内部法律中,这一核心的动作将提升,而公司法的提升。