基于网上弱标注数据的个性化图像标注研究

项目名称： 基于网上弱标注数据的个性化图像标注研究

项目编号： No.61303184

项目类型： 青年科学基金项目

立项/批准年度： 2014

项目学科： 自动化技术、计算机技术

项目作者： 李锡荣

作者单位： 中国人民大学

项目金额： 28万元

中文摘要： 自动图像标注对于用户管理和检索不断增加的图像数据至关重要。现有工作专注于构建通用型图像标注模型，即以一个模型应对所有用户。这种一对多的图像标注模式忽略了不同用户在特定情境下对于特定图像主题的偏好，使得标注准确性受到了根本性的制约。为了满足不同用户对于图像标注的个性化需求，本项目研究一个模型对应一个用户的图像标注新模式。为此，我们提出基于网上弱标注数据的个性化图像标注方法。为了突破训练数据获取的瓶颈，我们研究如何从普通用户在互联网上产生的大量弱标注数据中为特定语义标签选取相关正样本和负样本，以建立大规模通用型图像标注模型。进一步，我们研究从用户历史数据中动态挖掘其个人偏好，并结合图像产生时所处的包括地理、天气等上下文环境信息，对通用型图像标注模型所预测的标签进行个性化的优化排序，从而为每个用户提供可量身定制的自动图像标注模型。本项目的研究成果将为个性化的多媒体信息检索提供技术支撑。

中文关键词： 图像标注；社会化媒体；图像标签相关性学习；地理感知图像标注；

英文摘要： Automatic image annotation is crucial for managing and retrieving the increasing amounts of image data. Existing work focuses on building a generic image annotation model, hoping that the model would be universally applicable to annotate images from distinct users. However, such an one-for-all tagging mode ignores a user's personal preference for image subjects in a given situation, resulting in limited annotation accuracy. In order to annotate images in a personalized manner, this proposal studies a new one-model-for-one-user tagging mode. We propose personalized image annotation based on many weakly labeled examples available on the Internet. In order to build a large-scale generic image annotation model, we study how to select relevant training examples for a given semantic tag from the weakly labeled data. For a specific user, we mine her personal preference over image subjects from her past data. Personal preference, in combination with multiple contextual information including the place where images were made and local weather, are then exploited to optimize the prediction of the generic model in a personalized manner. By doing so, we deliver image annotation models personalized with respect to individual users.The resultant techniques of this research will be beneficial to personalized multimedia informat

英文关键词： image annotation；social media；image tag relevance learning；geo-aware image cannotation；

成为VIP会员查看完整内容