实现个性化和人文化 (Towards Personalized and Human-in-the-Loop Document Summarization) - 专知论文

会员服务 ·

0

INFORMS · Processing（编程语言） · Engineering · INTERACT · state-of-the-art ·

2021 年 10 月 1 日

Towards Personalized and Human-in-the-Loop Document Summarization

翻译：实现个性化和人文化

Samira Ghodratnama

The ubiquitous availability of computing devices and the widespread use of the internet have generated a large amount of data continuously. Therefore, the amount of available information on any given topic is far beyond humans' processing capacity to properly process, causing what is known as information overload. To efficiently cope with large amounts of information and generate content with significant value to users, we require identifying, merging and summarising information. Data summaries can help gather related information and collect it into a shorter format that enables answering complicated questions, gaining new insight and discovering conceptual boundaries. This thesis focuses on three main challenges to alleviate information overload using novel summarisation techniques. It further intends to facilitate the analysis of documents to support personalised information extraction. This thesis separates the research issues into four areas, covering (i) feature engineering in document summarisation, (ii) traditional static and inflexible summaries, (iii) traditional generic summarisation approaches, and (iv) the need for reference summaries. We propose novel approaches to tackle these challenges, by: i)enabling automatic intelligent feature engineering, ii) enabling flexible and interactive summarisation, iii) utilising intelligent and personalised summarisation approaches. The experimental results prove the efficiency of the proposed approaches compared to other state-of-the-art models. We further propose solutions to the information overload problem in different domains through summarisation, covering network traffic data, health data and business process data.

翻译：计算机设备无处不在的可用性和互联网的广泛使用不断产生大量数据。因此,关于任何特定主题的现有信息数量远远超过人类处理能力,远远超出了人类处理能力,无法正确处理,造成信息超载。为了高效率地处理大量信息,生成对用户具有重要价值的内容,我们需要确定、合并和总结信息。数据摘要可以帮助收集相关信息,并将其收集成一个较短的格式,从而能够回答复杂的问题,获得新的洞察力和发现概念界限。本论文侧重于利用新式合成技术减轻信息超载的三大挑战。它进一步打算便利分析文件以支持个人化信息提取。该论文将研究问题分为四个领域,包括:(一) 文件汇总的特征工程,(二) 传统的静态和不灵活摘要,(三) 传统的通用汇总方法,以及(四) 参考摘要。我们提出了应对这些挑战的新办法,其方法是:一) 增强自动智能特征工程,二) 使灵活和互动的合成方法得以实现。三) 将数据超载性化方法分为四个领域,包括:(一) 文件汇总的特征工程设计,我们提出的其他智能和个体化数据汇总。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

网络流量监测与分析大数据综述，A Survey on Big Data for Network Traffic Monitoring and Analysis

网络流量监测与分析大数据综述，A Survey on Big Data for Network Traffic Monitoring and Analysis

专知会员服务

65+阅读 · 2020年3月5日

【图机器学习论文】图摘要方法与应用综述（Graph Summarization Methods and Applications: A Survey）

【图机器学习论文】图摘要方法与应用综述（Graph Summarization Methods and Applications: A Survey）

专知会员服务

42+阅读 · 2019年12月16日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【文本摘要】Text Summarization文本摘要与注意力机制

【文本摘要】Text Summarization文本摘要与注意力机制

深度学习自然语言处理

9+阅读 · 2020年3月15日

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

Call4Papers

6+阅读 · 2019年4月1日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议信息10条

人工智能 | 国际会议信息10条

Call4Papers

5+阅读 · 2018年12月18日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

NLP中自动生产文摘（auto text summarization）

NLP中自动生产文摘（auto text summarization）

数据挖掘入门与实战

4+阅读 · 2017年10月10日

Towards Generating Citation Sentences for Multiple References with Intent Control

Arxiv

0+阅读 · 2021年12月9日

Multi-document Summarization via Deep Learning Techniques: A Survey

Arxiv

0+阅读 · 2021年12月9日

CLIP-It! Language-Guided Video Summarization

Arxiv

0+阅读 · 2021年12月8日

Towards Natural Language Interfaces for Data Visualization: A Survey

Arxiv

9+阅读 · 2021年9月8日

What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization

Arxiv

8+阅读 · 2020年3月23日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

8+阅读 · 2019年10月22日

A User-Centered Concept Mining System for Query and Document Understanding at Tencent

Arxiv

6+阅读 · 2019年5月21日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

Automatic Summarization of Natural Language

Arxiv

3+阅读 · 2018年12月18日

Graph Summarization: A Survey

Arxiv

5+阅读 · 2017年4月12日

VIP会员

文章信息

相关主题

Processing（编程语言）

state-of-the-art

相关VIP内容

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

网络流量监测与分析大数据综述，A Survey on Big Data for Network Traffic Monitoring and Analysis

网络流量监测与分析大数据综述，A Survey on Big Data for Network Traffic Monitoring and Analysis

专知会员服务

65+阅读 · 2020年3月5日

【图机器学习论文】图摘要方法与应用综述（Graph Summarization Methods and Applications: A Survey）

【图机器学习论文】图摘要方法与应用综述（Graph Summarization Methods and Applications: A Survey）

专知会员服务

42+阅读 · 2019年12月16日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

兵棋系统文档：联合战区级模拟-全球行动（JTLS-GO®）

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

从无人机到数据：揭示边缘计算作为新作战域

综述：机器嗅觉与嵌入式人工智能正在塑造新的全球传感产业

相关资讯

【文本摘要】Text Summarization文本摘要与注意力机制

【文本摘要】Text Summarization文本摘要与注意力机制

深度学习自然语言处理

9+阅读 · 2020年3月15日

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

Call4Papers

6+阅读 · 2019年4月1日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人工智能 | 国际会议信息10条

人工智能 | 国际会议信息10条

Call4Papers

5+阅读 · 2018年12月18日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

NLP中自动生产文摘（auto text summarization）

NLP中自动生产文摘（auto text summarization）

数据挖掘入门与实战

4+阅读 · 2017年10月10日

相关论文

Towards Generating Citation Sentences for Multiple References with Intent Control

Arxiv

0+阅读 · 2021年12月9日

Multi-document Summarization via Deep Learning Techniques: A Survey

Arxiv

0+阅读 · 2021年12月9日

CLIP-It! Language-Guided Video Summarization

Arxiv

0+阅读 · 2021年12月8日

Towards Natural Language Interfaces for Data Visualization: A Survey

Arxiv

9+阅读 · 2021年9月8日

What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization

Arxiv

8+阅读 · 2020年3月23日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

8+阅读 · 2019年10月22日

A User-Centered Concept Mining System for Query and Document Understanding at Tencent

Arxiv

6+阅读 · 2019年5月21日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

Automatic Summarization of Natural Language

Arxiv

3+阅读 · 2018年12月18日

Graph Summarization: A Survey

Arxiv

5+阅读 · 2017年4月12日

微信扫码咨询专知VIP会员