Revision is an essential part of the human writing process. It tends to be strategic, adaptive, and, more importantly, iterative in nature. Despite the success of large language models on text revision tasks, they are limited to non-iterative, one-shot revisions. Examining and evaluating the capability of large language models for making continuous revisions and collaborating with human writers is a critical step towards building effective writing assistants. In this work, we present a human-in-the-loop iterative text revision system, Read, Revise, Repeat (R3), which aims at achieving high quality text revisions with minimal human efforts by reading model-generated revisions and user feedbacks, revising documents, and repeating human-machine interactions. In R3, a text revision model provides text editing suggestions for human writers, who can accept or reject the suggested edits. The accepted edits are then incorporated into the model for the next iteration of document revision. Writers can therefore revise documents iteratively by interacting with the system and simply accepting/rejecting its suggested edits until the text revision model stops making further revisions or reaches a predefined maximum number of revisions. Empirical experiments show that R3 can generate revisions with comparable acceptance rate to human writers at early revision depths, and the human-machine interaction can get higher quality revisions with fewer iterations and edits. The collected human-model interaction dataset and system code are available at \url{https://github.com/vipulraheja/IteraTeR}. Our system demonstration is available at \url{https://youtu.be/lK08tIpEoaE}.
翻译:修改是人文写作过程的一个基本部分。 它往往具有战略性、适应性,更重要的是,具有迭接性。 尽管在文本修改任务方面大型语言模式取得了成功,但仅限于非文字性、一次性的修订。 审查和评估大语言模式持续修改以及与人类作家合作的能力是建立有效写作助理的关键步骤。 在此工作中, 我们提出一个“ 人文在网上”迭接文本修订系统, 读、 修改、 重复( R3), 目的是通过阅读模型产生的修改和用户反馈, 修改文件, 重复人类机器互动, 实现高质量的文本修订。 在 R3 中, 文本修订模式为人类作家提供文本编辑建议, 他们可以接受或拒绝建议的编辑。 然后, 接受的编辑将纳入下一个文件修改的模型。 因此, 作者可以通过与系统互动, 简单接受/ 修改其建议的编辑, 直至文本修订模式不再进一步修订或达到预定的最大修改数量。 Epricalalalalalal 实验显示, R3 能够与人类系统进行更高质量的修改, 和人类更深入的修改。