Linear Discriminant Analysis (LDA) is a widely-used supervised dimensionality reduction method in computer vision and pattern recognition. In null space based LDA (NLDA), a well-known LDA extension, between-class distance is maximized in the null space of the within-class scatter matrix. However, there are some limitations in NLDA. Firstly, for many data sets, null space of within-class scatter matrix does not exist, thus NLDA is not applicable to those datasets. Secondly, NLDA uses arithmetic mean of between-class distances and gives equal consideration to all between-class distances, which makes larger between-class distances can dominate the result and thus limits the performance of NLDA. In this paper, we propose a harmonic mean based Linear Discriminant Analysis, Multi-Class Discriminant Analysis (MCDA), for image classification, which minimizes the reciprocal of weighted harmonic mean of pairwise between-class distance. More importantly, MCDA gives higher priority to maximize small between-class distances. MCDA can be extended to multi-label dimension reduction. Results on 7 single-label data sets and 4 multi-label data sets show that MCDA has consistently better performance than 10 other single-label approaches and 4 other multi-label approaches in terms of classification accuracy, macro and micro average F1 score.
翻译:在以空间为基础的LDA(LDA)中,众所周知的LDA(LDA)是LDA(LDA)的延伸。在空基LDA(NLDA)中,阶级之间的距离在阶级内部散射矩阵的空隙中最大化。然而,LNA中有一些限制。首先,对于许多数据集而言,阶级内部散射矩阵的空格并不存在,因此,全国民主联盟不适用于这些数据集。第二,全国民主联盟使用阶级间距离的算术平均值,对各阶级间距离给予同等的考虑,使更大型的阶级间距离能够主宰结果,从而限制全国民主联盟的绩效。在本文中,我们建议采用基于和谐平均线性差异分析、多种族差异分析(MCDA)来协调性平均等距离,以进行图像分类,从而尽可能减少各阶级间距离之间加权的加权调和差的对等。更重要的是,MCD(MA)可扩展至多标签范围,MA可扩展至多标签层面。在7个单级标准、多级标准、多级标准、多级标准、多级标准、多级标准、多级标准、多级标准、多级标准、多级数据上的结果比标准、多级标准。