Wizard of Oz (WoZ) as a prototyping method has been used to simulate intelligent user interfaces, particularly for speech-based systems. However, as our societies' expectations on artificial intelligence (AI) grows, the question remains whether a single Wizard is sufficient for it to simulate smarter systems and more complex interactions. Optimistic visions of 'what artificial intelligence (AI) can do' places demands on WoZ platforms to simulate smarter systems and more complex interactions. This raises the question of whether the typical approach of employing a single Wizard is sufficient. Moreover, while existing work has employed multiple Wizards in WoZ studies, a multi-Wizard approach has not been systematically studied in terms of feasibility, effectiveness, and challenges. We offer Wizundry, a real-time, web-based WoZ platform that allows multiple Wizards to collaboratively operate a speech-to-text based system remotely. We outline the design and technical specifications of our open-source platform, which we iterated over two design phases. We report on two studies in which participant-Wizards were tasked with negotiating how to cooperatively simulate an interface that can handle natural speech for dictation and text editing as well as other intelligent text processing tasks. We offer qualitative findings on the Multi-Wizard experience for Dyads and Triads of Wizards. Our findings reveal the promises and challenges of the multi-Wizard approach and open up new research questions.
翻译:巫师奥兹(WoZ)作为一种原型制作方法,已被用于模拟智能用户界面,特别是语音系统。然而,随着人们对人工智能(AI)的期望不断增长,一个巫师是否足以模拟更智能的系统和更复杂的交互,这个问题仍然存在。对“人工智能可以做到什么”的乐观前景的看法,使得WoZ平台需要模拟更智能的系统和更复杂的交互。这就引起了一个问题,是否采用单巫师的典型方法是足够的。此外,虽然现有的工作在WoZ研究中使用了多个巫师,但多巫师方法尚未在可行性、有效性和挑战方面进行系统研究。我们提供Wizundry,一个实时的、基于Web的WoZ平台,允许多个巫师在远程协作操作基于语音识别的系统。我们概述了我们的开源平台的设计和技术规范,这是在两个设计阶段上迭代的。我们报告了两项研究,其中参与者巫师被要求协商如何协作地模拟一个可以处理自然语音的界面,包括口述和文本处理等其他智能文本处理任务。我们提供巫师二元组和三元组的多巫师体验的定性结果。我们的发现揭示了多巫师方法的优点和挑战,并开启了新的研究问题。