We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scaling, and higher-order transformations are challenging to model in native image space, we show the R-CDT can capture some of these variations and thus render the associated image classification problems easier to solve. The method -- utilizing a nearest-subspace algorithm in R-CDT space -- is simple to implement, non-iterative, has no hyper-parameters to tune, is computationally efficient, label efficient, and provides competitive accuracies to state-of-the-art neural networks for many types of classification problems. In addition to the test accuracy performances, we show improvements (with respect to neural network-based methods) in terms of computational efficiency (it can be implemented without the use of GPUs), number of training samples needed for training, as well as out-of-distribution generalization. The Python code for reproducing our results is available at https://github.com/rohdelab/rcdt_ns_classifier.
翻译:我们提出了一个适用于广类图像变形模型的新监督图像分类方法。该方法利用先前描述的雷达累积分布变换(R-CDT)用于图像数据,其数学特性被用来以更适合机器学习的形式表达图像数据;虽然翻译、缩放和更高级变换等某些操作对于在本地图像空间建模具有挑战性,但我们显示R-CDT能够捕捉其中一些变异,从而使得相关的图像分类问题更容易解决。该方法 -- -- 使用R-CDT空间最近的子空间算法 -- -- 易于实施、非触地化、没有超参数可调、具有计算效率、标签效率,并为许多类型的分类问题提供最先进的神经网络设计。除了测试精确性表现外,我们还显示计算效率方面的改进(在神经网络方法方面)(可以在不使用GPUPS的情况下实施)、培训所需的培训样品数量以及超分级化一般化结果。