Supervised dictionary learning (SDL) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. The goal of SDL is to learn a class-discriminative dictionary, which is a set of latent feature vectors that can well-explain both the features as well as labels of observed data. In this paper, we provide a systematic study of SDL, including the theory, algorithm, and applications of SDL. First, we provide a novel framework that `lifts' SDL as a convex problem in a combined factor space and propose a low-rank projected gradient descent algorithm that converges exponentially to the global minimizer of the objective. We also formulate generative models of SDL and provide global estimation guarantees of the true parameters depending on the hyperparameter regime. Second, viewed as a nonconvex constrained optimization problem, we provided an efficient block coordinate descent algorithm for SDL that is guaranteed to find an $\varepsilon$-stationary point of the objective in $O(\varepsilon^{-1}(\log \varepsilon^{-1})^{2})$ iterations. For the corresponding generative model, we establish a novel non-asymptotic local consistency result for constrained and regularized maximum likelihood estimation problems, which may be of independent interest. Third, we apply SDL for imbalanced document classification by supervised topic modeling and also for pneumonia detection from chest X-ray images. We also provide simulation studies to demonstrate that SDL becomes more effective when there is a discrepancy between the best reconstructive and the best discriminative dictionaries.
翻译:监督字典学习( SDL) 是一种经典的机器学习方法, 同时寻求提取和分类任务, 这不一定是一个先验一致的目标。 SDL 的目标是学习一个等级偏差词典, 这是一种潜在的特征矢量, 可以很好地解释所观察到的数据的特征和标签。 在本文中, 我们提供SDL 的系统研究, 包括SDL 的理论、 算法和应用。 首先, 我们提供了一个新颖的框架, “ 提升' SDL, 作为综合因子空间的共解问题, 并提出一个低级的预测梯度下行算法, 以指数趋同目标的全球最小化。 我们还开发了SDLL的基因化模型, 并且根据超参数制度提供真实参数的全球估算保证。 其次, 我们被视为一个非convex 限制优化的问题, 我们为SDL提供了一个高效的底线运算法, 保证在 $( varepslon)-1 和 Stabilationalationalation 之间找到一个美元 值, roal dealislationalizalislational restial roislation roislation rouplisl) 。 我们还可以算算算算出一个最佳的模型, 和最佳的模型, 。