Details of the designs and mechanisms in support of human-AI collaboration must be considered in the real-world fielding of AI technologies. A critical aspect of interaction design for AI-assisted human decision making are policies about the display and sequencing of AI inferences within larger decision-making workflows. We have a poor understanding of the influences of making AI inferences available before versus after human review of a diagnostic task at hand. We explore the effects of providing AI assistance at the start of a diagnostic session in radiology versus after the radiologist has made a provisional decision. We conducted a user study where 19 veterinary radiologists identified radiographic findings present in patients' X-ray images, with the aid of an AI tool. We employed two workflow configurations to analyze (i) anchoring effects, (ii) human-AI team diagnostic performance and agreement, (iii) time spent and confidence in decision making, and (iv) perceived usefulness of the AI. We found that participants who are asked to register provisional responses in advance of reviewing AI inferences are less likely to agree with the AI regardless of whether the advice is accurate and, in instances of disagreement with the AI, are less likely to seek the second opinion of a colleague. These participants also reported the AI advice to be less useful. Surprisingly, requiring provisional decisions on cases in advance of the display of AI inferences did not lengthen the time participants spent on the task. The study provides generalizable and actionable insights for the deployment of clinical AI tools in human-in-the-loop systems and introduces a methodology for studying alternative designs for human-AI collaboration. We make our experimental platform available as open source to facilitate future research on the influence of alternate designs on human-AI workflows.
翻译:支持人类-AI合作的设计和机制细节必须在AI技术的实际领域考虑。AI技术的实际领域是AI协助人类决策的相互作用设计的关键方面,其关键方面是AI协助的人类决策决策流程中显示AI推断和排序的政策。我们对在人类对当前诊断任务进行审查之前和之后提供AI推断结果的影响认识不足。我们探讨了在放射学诊断会议开始时和放射师作出临时决定之后提供AI协助的影响。我们进行了用户研究,19名兽医放射学家在AI协助的人类决策中发现了病人X射线图像中的放射学发现。我们使用了两种工作流程配置来分析(一) 定点效果,(二) 人类-AI小组诊断性表现和协议,(三) 决策花费的时间和信心,(四) 认为AI的效用。我们发现,要求参与者在审查可选择的AI工具之前登记临时反应,我们不太可能同意AI的意见,而不论建议是否准确,在与AI平台发生分歧的情况下,也不太可能在与AI研究中要求进行初步分析。在AI的用户中,为研究提供有用的案例提供有用的。