The neural boom that has sparked natural language processing (NLP) research through the last decade has similarly led to significant innovations in data-to-text generation (DTG). This survey offers a consolidated view into the neural DTG paradigm with a structured examination of the approaches, benchmark datasets, and evaluation protocols. This survey draws boundaries separating DTG from the rest of the natural language generation (NLG) landscape, encompassing an up-to-date synthesis of the literature, and highlighting the stages of technological adoption from within and outside the greater NLG umbrella. With this holistic view, we highlight promising avenues for DTG research that not only focus on the design of linguistically capable systems but also systems that exhibit fairness and accountability.
翻译:在过去十年中,自然语言处理(NLP)研究的神经繁荣同样导致数据到文字生成(DTG)的重大创新。这项调查对DTG的神经范式提供了综合观点,对方法、基准数据集和评价程序进行了结构性审查。这项调查划定了DTG与其他自然语言生成(NLG)景观的界限,包括最新的文献综合,并突出了大NLG伞内外的技术采用阶段。我们从这一整体观点出发,强调DTG研究有希望的渠道,不仅侧重于设计语言能力系统,而且侧重于体现公平和问责的系统。