Automatic classification of diabetic retinopathy from retinal images has been widely studied using deep neural networks with impressive results. However, there is a clinical need for estimation of the uncertainty in the classifications, a shortcoming of modern neural networks. Recently, approximate Bayesian deep learning methods have been proposed for the task but the studies have only considered the binary referable/non-referable diabetic retinopathy classification applied to benchmark datasets. We present novel results by systematically investigating a clinical dataset and a clinically relevant 5-class classification scheme, in addition to benchmark datasets and the binary classification scheme. Moreover, we derive a connection between uncertainty measures and classifier risk, from which we develop a new uncertainty measure. We observe that the previously proposed entropy-based uncertainty measure generalizes to the clinical dataset on the binary classification scheme but not on the 5-class scheme, whereas our new uncertainty measure generalizes to the latter case.
翻译:对视网膜图像中的糖尿病视网膜病进行自动分类,利用深层神经网络进行了广泛研究,并取得了令人印象深刻的成果。然而,临床上需要估计分类中的不确定性,这是现代神经网络的一个缺陷。最近,为这项任务提出了近似贝叶斯深层学习方法,但研究只考虑了基准数据集所适用的二元可参考/不可引用糖尿病视网膜病分类。我们通过系统调查临床数据集和临床上相关的5级分类办法,以及基准数据集和二元分类办法,提出了新的结果。此外,我们从不确定性措施和分类风险之间得出了联系,我们从中制定了新的不确定性计量标准。我们注意到,先前提议的基于酶的不确定性计量标准概括了二元分类办法的临床数据集,但没有概括了5级办法,而我们新的不确定性计量标准则概括了后一种情况。