Objective: To investigate whether performance (number of correct decisions) of humans supported by a computer alerting tool can be improved by tailoring the tool's alerting threshold (sensitivity/specificity combination) according to user ability and task difficulty. Background: Many researchers implicitly assume that for each tool there exists a single ideal threshold. But research shows the effects of alerting tools on decision errors to vary depending on variables such as user ability and task difficulty. These findings motivated our investigation. Method: Forty-seven participants edited text passages, aided by a simulated spell-checker tool. We experimentally manipulated passage difficulty and tool alerting threshold, measured participants' editing and dictation ability, and counted participants' decision errors (false positives + false negatives). We investigated whether alerting threshold, user ability, task difficulty and their interactions affected error count. Results: Which alerting threshold better helped a user depended on an interaction between user ability and passage difficulty. Some effects were large: for higher ability users, a more sensitive threshold reduced errors by 30%, on the easier passages. Participants were not significantly more likely to prefer the alerting threshold with which they performed better. Conclusion: Adjusting alerting thresholds for individual users' ability and task difficulty could substantially improve effects of automated alerts on user decisions. This potential deserves further investigation. Improvement size and rules for adjustment will be application-specific. Application: Computer alerting tools have critical roles in many domains. Designers should assess potential benefits of adjustable alerting thresholds for their specific CAT application. Guidance for choosing thresholds will be essential for realizing these benefits in practice.
翻译:目标: 调查人类的性能( 正确的决定数目) 是否可以通过根据用户的能力和任务难度来根据用户的能力和任务难度来调整工具的警示阈值( 敏感度/ 具体度组合) 来提高工具的性能( 正确的决定数) 。 背景: 许多研究人员暗含地假设每个工具都有一个单一的理想阈值。 但是,研究显示了根据用户能力和任务难度等变量对决定错误的警示工具的影响。 这些发现促使我们进行调查。 方法: 47名参与者编辑了文本段落,并借助了模拟拼写检查工具。 我们实验性操纵了通道难度和工具警示阈值,测量了参与者的编辑和判断选择能力,并计算了参与者的决定错误( 假正数+虚假负数 )。 我们调查了警示阈值、 用户能力、 任务难度及其互动影响计数。 结果: 哪个警示阈值更好地帮助用户取决于用户能力和通过难度之间的相互作用。 某些影响很大: 对于能力更高的用户来说, 更敏感的阈值减少30 %, 对于较容易的通道。 参与者们更可能更倾向于选择用于设计其准确的警报性操作的准确的警告值 。 将提高用户的升级的临界值 。