Measuring the confidence of AI models is critical for safely deploying AI in real-world industrial systems. One important application of confidence measurement is information extraction from scanned documents. However, there exists no solution to provide reliable confidence score for current state-of-the-art deep-learning-based information extractors. In this paper, we propose a complete and novel architecture to measure confidence of current deep learning models in document information extraction task. Our architecture consists of a Multi-modal Conformal Predictor and a Variational Cluster-oriented Anomaly Detector, trained to faithfully estimate its confidence on its outputs without the need of host models modification. We evaluate our architecture on real-wold datasets, not only outperforming competing confidence estimators by a huge margin but also demonstrating generalization ability to out-of-distribution data.
翻译:测量AI模型的信任度对于在现实世界工业系统中安全部署AI至关重要。从扫描文件中提取信息是信任度测量的一个重要应用。然而,对于目前最先进的深层学习信息提取器来说,没有办法提供可靠的信任分数。在本文中,我们提出了一个完整和新颖的架构,以衡量当前在文件信息提取工作中深层学习模型的信心。我们的架构包括一个多模式非正式预测器和一个多层次的集群式反射探测器,该探测器受过训练,可以忠实地估计其对产出的信心,而不需要主机模型的修改。我们评估了我们关于真实狼人数据集的架构,不仅以巨大的利润比对竞争性估计器表现得更好,而且还展示了超越分布数据的普遍能力。