Conversation agents, commonly referred to as chatbots, are increasingly deployed in many domains to allow people to have a natural interaction while trying to solve a specific problem. Given their widespread use, it is important to provide their users with methods and tools to increase users awareness of various properties of the chatbots, including non-functional properties that users may consider important in order to trust a specific chatbot. For example, users may want to use chatbots that are not biased, that do not use abusive language, that do not leak information to other users, and that respond in a style which is appropriate for the user's cognitive level. In this paper, we address the setting where a chatbot cannot be modified, its training data cannot be accessed, and yet a neutral party wants to assess and communicate its trustworthiness to a user, tailored to the user's priorities over the various trust issues. Such a rating can help users choose among alternative chatbots, developers test their systems, business leaders price their offering, and regulators set policies. We envision a personalized rating methodology for chatbots that relies on separate rating modules for each issue, and users' detected priority orderings among the relevant trust issues, to generate an aggregate personalized rating for the trustworthiness of a chatbot. The method is independent of the specific trust issues and is parametric to the aggregation procedure, thereby allowing for seamless generalization. We illustrate its general use, integrate it with a live chatbot, and evaluate it on four dialog datasets and representative user profiles, validated with user surveys.
翻译:通常被称为聊天机的交流代理机构正在许多领域越来越多地被部署,以便人们在试图解决具体问题时能够自然地互动。鉴于其广泛使用,重要的是向其用户提供方法和工具,以提高用户对聊天机各种特性的认识,包括用户为信任某个特定的聊天机而可能认为重要的非功能性属性。例如,用户可能希望使用不偏颇、不使用不当语言、不向其他用户透露信息、不向其他用户透露信息、以适合用户认知水平的风格回应的聊天机。在本文件中,我们处理不能修改聊天机的用户设置,不能访问其培训数据,而中立方则希望根据用户对各种信任问题的优先事项来评估和向用户传达其信任度。这种评级可以帮助用户选择其他聊天机、开发商测试其系统、商业领导人提供的价格以及监管者制定政策。我们设想了对聊天机进行个人化评级的方法,该方法取决于每个问题的四个评级模块,不能修改其培训数据,但中立方则希望评估用户的信任度,从而将具体的信任度问题排序。