Computational models providing accurate estimates of their uncertainty are crucial for risk management associated with decision making in healthcare contexts. This is especially true since many state-of-the-art systems are trained using the data which has been labelled automatically (self-supervised mode) and tend to overfit. In this work, we investigate the quality of uncertainty estimates from a range of current state-of-the-art predictive models applied to the problem of observation detection in radiology reports. This problem remains understudied for Natural Language Processing in the healthcare domain. We demonstrate that Gaussian Processes (GPs) provide superior performance in quantifying the risks of 3 uncertainty labels based on the negative log predictive probability (NLPP) evaluation metric and mean maximum predicted confidence levels (MMPCL), whilst retaining strong predictive performance.
翻译:准确估计其不确定性的计算模型对于与医疗保健决策有关的风险管理至关重要,因为许多最先进的系统都使用自动标注的数据(自我监督模式)得到培训,而且往往过于完善,因此情况尤其如此。在这项工作中,我们调查了在放射学报告中用于观察检测问题的一系列最新预测模型的不确定性估算质量。这个问题在卫生保健领域的自然语言处理方面仍然没有得到充分研究。我们证明,高西亚进程在根据负逻辑预测概率(NLPP)评估指标和平均预期最大信任度(MMPCL)量化3个不确定性标签的风险方面表现优异,同时保持了强大的预测性能。