项目名称: 基于图的半监督学习关键问题研究及其在图像理解中的应用
项目编号: No.61202231
项目类型: 青年科学基金项目
立项/批准年度: 2013
项目学科: 计算机科学学科
项目作者: 卢志武
作者单位: 中国人民大学
项目金额: 24万元
中文摘要: 在图像理解的实际应用中,人工标注数据的代价总是很大,而收集大量未标注数据则较为容易。因此,作为利用未标注数据进行学习的经典方法,基于图的半监督学习对图像理解中的问题解决有着重要的意义,即它能够有效地利用大量未标注数据改善图像理解的效果,从而减少对人工标注的依赖。虽然相关的研究工作已经取得不错的结果,但是仍然存在如下关键问题急需解决:如何根据具体的应用构建图;如何处理数据标注中的噪声;如何有效利用其它形式的监督信息(如成对约束)。本项目拟研究建立基于L1范数的拉普拉斯正则化方法来解决图构建和去噪这两个关键问题,同时研究将约束传递分解为一系列半监督学习子问题以便更好地利用约束信息,并研究将这些新方法应用于图像分类与标注、图像语义分析与表示等较难的图像理解问题。本项目有望在基础理论和关键技术方面取得较大的进展,并推动机器学习、模式识别、图像处理、计算机视觉等相关领域的发展。
中文关键词: 半监督学习;拉普拉斯正则化;约束传递;图像理解;
英文摘要: In different applications of image understanding, the manual labeling of training data is commonly tedious and time-consuming, while the access to a large number of unlabeled data is much easier. Hence, as a typical technique for learning with labeled and unlabeled data, graph-based semi-supervised learning plays an important role in image understanding, since the large number of unlabeled data can thus be utilized to improve the results of image understanding. Although many promising results have been reported in previous work, there remain three key problems associated with graph-based semi-supervised learning: how to construct an application-dependent graph; how to handle noisy labels; how to effectively exploit other types of supervisory information (e.g. pairwise constraints). In this project, we make attempt to solve the first two key problems (i.e. graph construction and noise removal) by proposing a new L1-norm Laplacian regularization method, then decompose the challenging problem of pairwise constraint propagation into a series of semi-supervised learning subproblems to effectively exploit this supervisory information, and finally extend the proposed methods to different challenging tasks of image understanding (e.g. image classification and annotation, image semantic analysis and representation). The
英文关键词: semi-supervised learning;Laplacian regularization;pairwise constraint propagation;image understanding;