Writing is a complex process at the center of much of modern human activity. Despite it appears to be a linear process, writing conceals many highly non-linear processes. Previous research has focused on three phases of writing: planning, translation and transcription, and revision. While research has shown these are non-linear, they are often treated linearly when measured. Here, we introduce measures to detect and quantify subcycles of planning (exploration) and translation (exploitation) during the writing process. We apply these to a novel dataset that recorded the creation of a text in all its phases, from early attempts to the finishing touches on a final version. This dataset comes from a series of writing workshops in which, through innovative versioning software, we were able to record all the steps in the construction of a text. More than 60 junior researchers in science wrote a scientific essay intended for a general readership. We recorded each essay as a writing cloud, defined as a complex topological structure capturing the history of the essay itself. Through this unique dataset of writing clouds, we expose a representation of the writing process that quantifies its complexity and the writer's efforts throughout the draft and through time. Interestingly, this representation highlights the phases of "translation flow", where authors improve existing ideas, and exploration, where creative deviations appear as the writer returns to the planning phase. These turning points between translation and exploration become rarer as the writing process progresses and the author approaches the final version. Our results and the new measures introduced have the potential to foster the discussion about the non-linear nature of writing and support the development of tools that can support more creative and impactful writing processes.
翻译:写作过程是一个复杂的过程, 是现代人类活动的中心。 尽管它看起来是一个线性过程, 写作过程掩盖了许多高度非线性的过程。 以前的研究集中在三个写作阶段: 规划、 翻译、 抄录和 修改。 虽然研究显示这些是非线性, 但当测量时往往被线性地处理。 在这里, 我们引入了测量和量化规划( 探索) 和翻译( 开发) 的子周期( 开发) 的措施。 我们将这些应用到一个新数据集, 它记录了从早期尝试到最终版本的结束。 这个数据集来自一系列的写作讲习班, 通过创新版本, 我们记录了文本的创建过程。 这个数据集来自一系列的写作工作, 通过创新版本的软件, 我们得以记录这些步骤。