基于为深学习分类而预定义的深学习特性最佳分布的无软负损失函数的 Softmax-level 函数 (A Softmax-free Loss Function Based on Predefined Optimal-distribution of Latent Features for Deep Learning Classifier)

In the field of pattern classification, the training of deep learning classifiers is mostly end-to-end learning, and the loss function is the constraint on the final output (posterior probability) of the network, so the existence of Softmax is essential. In the case of end-to-end learning, there is usually no effective loss function that completely relies on the features of the middle layer to restrict learning, resulting in the distribution of sample latent features is not optimal, so there is still room for improvement in classification accuracy. Based on the concept of Predefined Evenly-Distributed Class Centroids (PEDCC), this article proposes a Softmax-free loss function based on predefined optimal-distribution of latent features-POD Loss. The loss function only restricts the latent features of the samples, including the norm-adaptive Cosine distance between the latent feature vector of the sample and the center of the predefined evenly-distributed class, and the correlation between the latent features of the samples. Finally, Cosine distance is used for classification. Compared with the commonly used Softmax Loss, some typical Softmax related loss functions and PEDCC-Loss, experiments on several commonly used datasets on several typical deep learning classification networks show that the classification performance of POD Loss is always significant better and easier to converge. Code is available in https://github.com/TianYuZu/POD-Loss.

翻译：在模式分类领域,对深层次学习分类师的培训大多是端到端学习,而损失功能则是对网络最终产出的限制(其他概率),因此 Softmax的存在至关重要。在端到端学习方面,通常没有完全依赖中层特征的有效损失功能,完全依赖中层特征限制学习,导致样本潜在特征分布不尽理想,因此在分类准确性方面仍有改进的余地。根据预先定义的均匀分布类中心(PEDCC)的概念,本文章提议在预先定义的潜伏特性优化分布的基础上,无软麦损失功能,因此,Softmax的存在是必不可少的。在端到端学习中层学习,包括样本潜在特性矢量与前定义平均分布级中心之间的常规适应性距离,以及样本潜在特征之间的关联性关系。最后,Cosine距离用于分类。与常用的 Softmax损失相比,一些典型的软减缩值-PDLOLO-S-SLOD 通常使用的一些标准性能显示显著的实验性能和高级数据。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日