Instagram社会网络员额的自动汇总</s> (Automatic summarisation of Instagram social network posts Combining semantic and statistical approaches)

The proliferation of data and text documents such as articles, web pages, books, social network posts, etc. on the Internet has created a fundamental challenge in various fields of text processing under the title of "automatic text summarisation". Manual processing and summarisation of large volumes of textual data is a very difficult, expensive, time-consuming and impossible process for human users. Text summarisation systems are divided into extractive and abstract categories. In the extractive summarisation method, the final summary of a text document is extracted from the important sentences of the same document without any modification. In this method, it is possible to repeat a series of sentences and to interfere with pronouns. However, in the abstract summarisation method, the final summary of a textual document is extracted from the meaning and significance of the sentences and words of the same document or other documents. Many of the works carried out have used extraction methods or abstracts to summarise the collection of web documents, each of which has advantages and disadvantages in the results obtained in terms of similarity or size. In this work, a crawler has been developed to extract popular text posts from the Instagram social network with appropriate preprocessing, and a set of extraction and abstraction algorithms have been combined to show how each of the abstraction algorithms can be used. Observations made on 820 popular text posts on the social network Instagram show the accuracy (80%) of the proposed system.

翻译：互联网上诸如文章、网页、书籍、社交网络文章等数据和文本文件的扩散,在“自动文本摘要”的标题下,给文本处理的各个领域带来了一个根本性的挑战。对大量文本数据进行手工处理和汇总对于人类用户来说是一个非常困难、昂贵、耗时和不可能的过程。文本摘要系统分为采掘和抽象的类别。在抽取归纳方法中,文本文件的最后摘要取自同一文件的重要句子,而不作任何修改。在这种方法中,有可能重复一系列的句子并干扰预言。但是,在抽象总结方法中,文本文件的最后摘要摘自同一文件或其他文件的句子和文字的含义和意义。许多完成的著作使用了提取方法或摘要来对网络文件的收集进行总结,其中每一种文件都具有相似性或大小方面的结果的利弊。在这个方法中,已经开发了一个检索器,从Instagram Social 20 中从Instital commex production中提取了一批受欢迎的文本,而每套Mismalmagraphical 20 都用了一套社会算法的缩算图,并用了一套图前和一套图。</s>

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Python计算导论，560页pdf，Introduction to Computing Using Python

专知会员服务

76+阅读 · 2020年5月5日

专知会员服务

59+阅读 · 2019年11月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019