基于实例迁移的文本情感分析领域适应问题研究 - 专知基金

会员服务 ·

0

情感分析 · 领域适应 · 迁移学习 ·

2013 年 12 月 31 日

基于实例迁移的文本情感分析领域适应问题研究

国家自然科学基金

国家自然科学基金委员会

项目名称： 基于实例迁移的文本情感分析领域适应问题研究

项目编号： No.61305090

项目类型： 青年科学基金项目

立项/批准年度： 2014

项目学科： 自动化技术、计算机技术

项目作者： 夏睿

作者单位： 南京理工大学

项目金额： 23万元

中文摘要： 情感分析领域适应问题是近几年来自然语言处理领域的前沿问题和研究热点。在领域适应问题的研究中，存在"重标注迁移、轻实例迁移"的现象：标注迁移已经得到了广泛关注和深入研究；而实例迁移受制于概率比估计的难题，成为相对薄弱的环节和空白的地带。本项目针对上述状况，主要开展下列三方面的工作：1、针对实例迁移展开深入研究。将概率比估计问题转化为样本与分布的相似度计算问题，提出基于PU学习的源领域样本与目标领域相似度计算方法；2、基于该相似度，研究跨领域统计建模中的样本选择和权重采样方法，建立完善的基于实例迁移的领域适应模型；3、最后将问题扩展到多个源领域，探讨基于多源领域协同的情感分析领域适应方法。本项目预计在国内外重要学术期刊和顶级国际会议上发表论文不少于6篇。本项目的完成将有助于推动"大数据"背景下面向互联网海量和多源文本的情感分析方法的研究，具有重要的理论意义和应用价值。

中文关键词： 情感分析；领域适应；迁移学习；；

英文摘要： Domain adaptation in sentiment classficaition have been a frontier research direction in recent years in the field of natural language processing. "Instance adaptation" and "labeling adaptation" are two basic factors in the domain adaptation problem. "Labeling adaptation" has received wide attention and in-depth study. However, since density ratio is hard to estimate, "instance adaptation" has not been well studied in the literature. In this project, we are going to conduct research in three aspects: Firstly, we transform density ratio estimation into the problem of similarity measure between a sample and a distribution, and proposed a PU learning based approach to address this problem. Secondly, based on the similarities obtained by PU learning, we study the issues of sample selction and importance sampling for instance adaptation; Finally, we extend the task from one source to multiple source domains, and study correspoonding domain adaptation methods. The implementation of this project will help promote the research of cross-domain sentiment classification, which is a fundamental problem of text mining on the "big-data" Internet.

英文关键词： Sentiment Analysis；Domain Adaptation；Transfer Learning；；

成为VIP会员查看完整内容

0

相关内容

情感分析

狭义的情感分析（sentiment analysis）是指利用计算机实现对文本数据的观点、情感、态度、情绪等的分析挖掘。广义的情感分析则包括对图像视频、语音、文本等多模态信息的情感计算。简单地讲，情感分析研究的目标是建立一个有效的分析方法、模型和系统，对输入信息中某个对象分析其持有的情感信息，例如观点倾向、态度、主观观点或喜怒哀乐等情绪表达。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

「图分类研究」最新2022综述

「图分类研究」最新2022综述

专知会员服务

99+阅读 · 2022年2月13日

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

专知会员服务

21+阅读 · 2021年11月11日

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

专知会员服务

12+阅读 · 2021年10月23日

文本情感分析方法研究综述

专知会员服务

56+阅读 · 2021年4月20日

【斯坦福CS330】终身学习: 问题陈述，前后迁移，30页ppt

【斯坦福CS330】终身学习: 问题陈述，前后迁移，30页ppt

专知会员服务

26+阅读 · 2020年12月13日

基于双注意力机制和迁移学习的跨领域推荐模型

专知会员服务

48+阅读 · 2020年10月20日

基于迁移学习的细粒度实体分类方法的研究

专知会员服务

32+阅读 · 2020年9月2日

【ACL2020】基于图神经网络的文本分类新方法

【ACL2020】基于图神经网络的文本分类新方法

专知会员服务

69+阅读 · 2020年7月12日

融合零样本学习和小样本学习的弱监督机器学习方法综述

专知会员服务

113+阅读 · 2020年3月20日

重庆大学发布最新《迁移自适应学习十年进展》综述论文，26页pdf333篇文献

重庆大学发布最新《迁移自适应学习十年进展》综述论文，26页pdf333篇文献

专知会员服务

43+阅读 · 2019年11月26日

迁移学习前沿探究探讨：低资源、领域泛化与安全迁移

迁移学习前沿探究探讨：低资源、领域泛化与安全迁移

大数据文摘

2+阅读 · 2022年4月13日

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

专知

0+阅读 · 2021年11月11日

IJCAI 2021 | 不确定性感知小样本图像分类模型，实现SOTA性能

IJCAI 2021 | 不确定性感知小样本图像分类模型，实现SOTA性能

微软研究院AI头条

0+阅读 · 2021年8月19日

文本情感分析方法研究综述

文本情感分析方法研究综述

专知

4+阅读 · 2021年4月20日

【综述】迁移自适应学习十年进展

【综述】迁移自适应学习十年进展

专知

41+阅读 · 2019年11月26日

赛尔笔记 | 自然语言处理中的迁移学习(上)

赛尔笔记 | 自然语言处理中的迁移学习(上)

哈工大SCIR

16+阅读 · 2019年10月17日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

NLP预训练模型大集合！

NLP预训练模型大集合！

全球人工智能

31+阅读 · 2018年12月29日

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

开放知识图谱

16+阅读 · 2018年12月2日

文本情感分析的预处理

文本情感分析的预处理

Datartisan数据工匠

17+阅读 · 2018年3月8日

HDFS读、写性能概率建模与模型迁移方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向机器翻译的多词表达语义分析及应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于迁移学习的脑机接口特征提取和预测方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

弱标注下基于主动学习的检测器适应问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于生成模型的迁移学习算法研究及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于非平行超平面支持向量机的最优化模型及算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

领域适应性问题相关学习算法与理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于跨域迁移学习的网络视频检索技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向文本分类的迁移学习和半监督学习方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于机器学习的多义性数据概念标注研究

国家自然科学基金

4+阅读 · 2011年12月31日

Towards Arabic Sentence Simplification via Classification and Generative Approaches

Arxiv

0+阅读 · 2022年4月20日

Named Entity Recognition for Partially Annotated Datasets

Arxiv

0+阅读 · 2022年4月19日

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA

Arxiv

0+阅读 · 2022年4月19日

Network Bandwidth Variation-Adapted State Transfer for Geo-Replicated State Machines and its Application to Dynamic Replica Replacement

Arxiv

0+阅读 · 2022年4月19日

Automated Data Augmentations for Graph Classification

Automated Data Augmentations for Graph Classification

Arxiv

1+阅读 · 2022年4月18日

Cross-Modal Coherence for Text-to-Image Retrieval

Arxiv

0+阅读 · 2022年4月15日

Accumulate: An identity-based blockchain protocol with cross-chain support, human-readable addresses, and key management capabilities

Arxiv

0+阅读 · 2022年4月15日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

Self-supervised Learning: Generative or Contrastive

Arxiv

19+阅读 · 2020年7月21日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

阅读: 0 点赞: 0

小贴士

登录享主题订阅及个性化推荐

相关主题

热门VIP内容

开通专知VIP会员享更多权益服务

乌克兰太空研究（2022-2024年） | 176页

新型军用战斗机无人机（MFUAV’s）| 2025最新80页

国防领域人工智能走向何方？

无人机对士兵的心理影响

相关VIP内容

「图分类研究」最新2022综述

「图分类研究」最新2022综述

专知会员服务

99+阅读 · 2022年2月13日

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

专知会员服务

21+阅读 · 2021年11月11日

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

IEEE TPAMI | 基于标注偏差估计的实例相关PU学习

专知会员服务

12+阅读 · 2021年10月23日

文本情感分析方法研究综述

专知会员服务

56+阅读 · 2021年4月20日

【斯坦福CS330】终身学习: 问题陈述，前后迁移，30页ppt

【斯坦福CS330】终身学习: 问题陈述，前后迁移，30页ppt

专知会员服务

26+阅读 · 2020年12月13日

基于双注意力机制和迁移学习的跨领域推荐模型

专知会员服务

48+阅读 · 2020年10月20日

基于迁移学习的细粒度实体分类方法的研究

专知会员服务

32+阅读 · 2020年9月2日

【ACL2020】基于图神经网络的文本分类新方法

【ACL2020】基于图神经网络的文本分类新方法

专知会员服务

69+阅读 · 2020年7月12日

融合零样本学习和小样本学习的弱监督机器学习方法综述

专知会员服务

113+阅读 · 2020年3月20日

重庆大学发布最新《迁移自适应学习十年进展》综述论文，26页pdf333篇文献

重庆大学发布最新《迁移自适应学习十年进展》综述论文，26页pdf333篇文献

专知会员服务

43+阅读 · 2019年11月26日

相关资讯

迁移学习前沿探究探讨：低资源、领域泛化与安全迁移

迁移学习前沿探究探讨：低资源、领域泛化与安全迁移

大数据文摘

2+阅读 · 2022年4月13日

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

【NeurIPS2021】由相似性迁移的弱样本细粒度分类

专知

0+阅读 · 2021年11月11日

IJCAI 2021 | 不确定性感知小样本图像分类模型，实现SOTA性能

IJCAI 2021 | 不确定性感知小样本图像分类模型，实现SOTA性能

微软研究院AI头条

0+阅读 · 2021年8月19日

文本情感分析方法研究综述

文本情感分析方法研究综述

专知

4+阅读 · 2021年4月20日

【综述】迁移自适应学习十年进展

【综述】迁移自适应学习十年进展

专知

41+阅读 · 2019年11月26日

赛尔笔记 | 自然语言处理中的迁移学习(上)

赛尔笔记 | 自然语言处理中的迁移学习(上)

哈工大SCIR

16+阅读 · 2019年10月17日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

NLP预训练模型大集合！

NLP预训练模型大集合！

全球人工智能

31+阅读 · 2018年12月29日

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

开放知识图谱

16+阅读 · 2018年12月2日

文本情感分析的预处理

文本情感分析的预处理

Datartisan数据工匠

17+阅读 · 2018年3月8日

相关基金

HDFS读、写性能概率建模与模型迁移方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向机器翻译的多词表达语义分析及应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于迁移学习的脑机接口特征提取和预测方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

弱标注下基于主动学习的检测器适应问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于生成模型的迁移学习算法研究及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于非平行超平面支持向量机的最优化模型及算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

领域适应性问题相关学习算法与理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于跨域迁移学习的网络视频检索技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向文本分类的迁移学习和半监督学习方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于机器学习的多义性数据概念标注研究

国家自然科学基金

4+阅读 · 2011年12月31日

相关论文

Towards Arabic Sentence Simplification via Classification and Generative Approaches

Arxiv

0+阅读 · 2022年4月20日

Named Entity Recognition for Partially Annotated Datasets

Arxiv

0+阅读 · 2022年4月19日

XLM-E: Cross-lingual Language Model Pre-training via ELECTRA

Arxiv

0+阅读 · 2022年4月19日

Network Bandwidth Variation-Adapted State Transfer for Geo-Replicated State Machines and its Application to Dynamic Replica Replacement

Arxiv

0+阅读 · 2022年4月19日

Automated Data Augmentations for Graph Classification

Automated Data Augmentations for Graph Classification

Arxiv

1+阅读 · 2022年4月18日

Cross-Modal Coherence for Text-to-Image Retrieval

Arxiv

0+阅读 · 2022年4月15日

Accumulate: An identity-based blockchain protocol with cross-chain support, human-readable addresses, and key management capabilities

Arxiv

0+阅读 · 2022年4月15日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

Self-supervised Learning: Generative or Contrastive

Arxiv

19+阅读 · 2020年7月21日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

微信扫码咨询专知VIP会员