Large-scale language models are rapidly improving, performing well on a wide variety of tasks with little to no customization. In this work we investigate how language models can support science writing, a challenging writing task that is both open-ended and highly constrained. We present a system for generating "sparks", sentences related to a scientific concept intended to inspire writers. We find that our sparks are more coherent and diverse than a competitive language model baseline, and approach a human-created gold standard. In a study with 13 PhD students writing on topics of their own selection, we find three main use cases of sparks: aiding with crafting detailed sentences, providing interesting angles to engage readers, and demonstrating common reader perspectives. We also report on the various reasons sparks were considered unhelpful, and discuss how we might improve language models as writing support tools.
翻译:大型语言模式正在迅速改善,在各种任务上表现良好,很少甚至没有定制。在这项工作中,我们调查语言模式如何支持科学写作,这是一项具有挑战性的写作任务,既开放又受高度限制。我们提出了一个产生“园艺”的系统,其句子与旨在激励作家的科学概念相关。我们发现,我们的火花比竞争性语言模式基线更加一致和多样化,并采用人造黄金标准。在与13名博士生就自己选择的专题撰写论文的研究中,我们发现了三种主要使用火花的例子:协助撰写详细句子,提供吸引读者的有趣角度,并展示共同读者的观点。我们还报告了各种原因,认为火花没有帮助,并讨论了我们如何改进语言模式作为写作支持工具。