Prompt-based or in-context learning has achieved high zero-shot performance on many natural language generation (NLG) tasks. Here we explore the performance of prompt-based learning for simultaneously controlling the personality and the semantic accuracy of an NLG for task-oriented dialogue. We experiment with prompt-based learning on the PERSONAGE restaurant recommendation corpus to generate semantically and stylistically-controlled text for 5 different Big-5 personality types: agreeable, disagreeable, conscientious, unconscientious, and extravert. We test two different classes of discrete prompts to generate utterances for a particular personality style: (1) prompts that demonstrate generating directly from a meaning representation that includes a personality specification; and (2) prompts that rely on first converting the meaning representation to a textual pseudo-reference, and then using the pseudo-reference in a textual style transfer (TST) prompt. In each case, we show that we can vastly improve performance by over-generating outputs and ranking them, testing several ranking functions based on automatic metrics for semantic accuracy, personality-match, and fluency. We also test whether NLG personality demonstrations from the restaurant domain can be used with meaning representations for the video game domain to generate personality stylized utterances about video games. Our findings show that the TST prompts produces the highest semantic accuracy (78.46% for restaurants and 87.6% for video games) and personality accuracy (100% for restaurants and 97% for video games). Our results on transferring personality style to video game utterances are surprisingly good. To our knowledge, there is no previous work testing the application of prompt-based learning to simultaneously controlling both style and semantic accuracy in NLG.
翻译:快速即时或文中学习在许多自然语言生成(NLG)任务中取得了高零分的成绩。在这里,我们探索了同步学习的性能,以同时控制NLG的任务导向对话的个性和语义准确性。我们实验了在PeperSONAG餐厅建议堆上快速学习,以生成5种不同的大五个个个个性类型:可接受、可接受、可接受、自觉、不自觉和外向。我们测试了两种不同的独立游戏风格。我们测试了两种不同类别,以生成某种性格风格的发音:(1) 快速学习,以同时控制NLG游戏的个性和语义准确性;以及(2) 快速进行测试,以同时控制NLG游戏的个性和语义,然后在文本风格传输(TST)时使用假的参考。在每种情况下,我们显示我们可以通过过度生成产出和排序来大大改进性能,测试基于自动测量的精度、个性化和流利度的几级函数。我们还可以测试NLG游戏的直观性、直径直径、Stenvial性、Syal、Syal、Syal、Syal、Syal、Syal、Syal、Syal、Syal、T、Syals、Syal、Syal、Speal、Syal、Specudu、Speal、Speal、Speal、Speal、Speal、Speal、Syal、Syal、Syal、Syal、Spe、Sy、Spe、Sped、Spedal、Spe、Spe、Spe、Sped、Sped、SD、Sped、Spe、Spedal、Speal、SD、Sped、SD、Spe、SD、Speds、Speds、S、Sped、Spedia、Speal、Spedia、Sped、Sped、Sped、S、S、S、S、S、Sped、S、SDres、SD、SD、Spedia、S、S、SD、Speal、SD、Sped、S