An algorithm based on a deep probabilistic architecture referred to as a tree-structured sum-product network (t-SPN) is considered for cell classification. The t-SPN is constructed such that the unnormalized probability is represented as conditional probabilities of a subset of most similar cell classes. The constructed t-SPN architecture is learned by maximizing the margin, which is the difference in the conditional probability between the true and the most competitive false label. To enhance the generalization ability of the architecture, L2-regularization (REG) is considered along with the maximum margin (MM) criterion in the learning process. To highlight cell features, this paper investigates the effectiveness of two generic high-pass filters: ideal high-pass filtering and the Laplacian of Gaussian (LOG) filtering. On both HEp-2 and Feulgen benchmark datasets, the t-SPN architecture learned based on the max-margin criterion with regularization produced the highest accuracy rate compared to other state-of-the-art algorithms that include convolutional neural network (CNN) based algorithms. The ideal high-pass filter was more effective on the HEp-2 dataset, which is based on immunofluorescence staining, while the LOG was more effective on the Feulgen dataset, which is based on Feulgen staining.
翻译:本文考虑了一种基于深度概率架构——树状和积网络(t-SPN)的算法进行细胞分类。t-SPN被构建为未归一化概率,以最相似的细胞类别的条件概率表示。构建的t-SPN架构通过最大化间隔(MM)学习,由此来增加分类边缘,即真实标签与最具有竞争力的错误标签之间的条件概率差异。为了提高该架构的泛化能力,本文在学习过程中考虑了L2正则化(REG)以及最大间隔(MM)标准。为了突出显示细胞特征,本文研究了两种通用高通滤波器的有效性:理想高通滤波和高斯拉普拉斯(LOG)滤波。在HEp-2和Feulgen基准数据集上,基于最大间隔标准和正则化学习的t-SPN架构相对于其他最先进的算法,包括基于卷积神经网络(CNN)的算法,产生了最高的准确率。理想高通滤波器更适用于基于免疫荧光染色的HEp-2数据集,而LOG滤波器更适用于基于Feulgen染色的Feulgen数据集。