新闻中的大流行性传播:使用专题模式的文本分析 (How Pandemic Spread in News: Text Analysis Using Topic Model) - 专知论文

会员服务 ·

0

话题模型 · MoDELS · LDA · COVID-19 · 话题 ·

2021 年 2 月 5 日

How Pandemic Spread in News: Text Analysis Using Topic Model

翻译：新闻中的大流行性传播:使用专题模式的文本分析

Minghao Wang,Paolo Mengoni

COVID-19 pandemic has made tremendous impact on the whole world, both the real world and the media atmosphere. Our research conducted a text analysis using LDA topic model. We first scraped 1127 articles and 5563 comments on SCMP covering COVID-19 from Jan 20 to May 19, then we trained the LDA model and tuned parameters based on the $C_v$ coherence as the model evaluation method. With the optimal model, dominant topics, representative documents of each topic and the inconsistency between articles and comments are analyzed. Some factors of the inconsistency are discussed at last.

翻译：我们的研究利用LDA专题模型进行了文本分析,我们首先从1月20日至5月19日对涉及COVID-19的SCMP的1127篇文章和5563条评论进行了剪辑,然后我们根据美元和五美元的连贯性对LDA模型和调制参数进行了培训,作为评价模式,分析了最佳模型、主要专题、每个专题的代表性文件以及文章和评论之间的不一致之处,最后讨论了不一致的一些因素。

0

相关内容

话题模型

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

专知会员服务

8+阅读 · 2019年11月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

已删除

将门创投

6+阅读 · 2019年9月3日

Linguistically Regularized LSTMs for Sentiment Classification

Linguistically Regularized LSTMs for Sentiment Classification

黑龙江大学自然语言处理实验室

8+阅读 · 2018年5月4日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Properties of Inconsistency Measures for Databases

Arxiv

0+阅读 · 2021年3月30日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Arxiv

4+阅读 · 2018年11月5日

Topic Modelling of Empirical Text Corpora: Validity, Reliability, and Reproducibility in Comparison to Semantic Maps

Arxiv

4+阅读 · 2018年6月4日

Multi-hop Inference for Sentence-level TextGraphs: How Challenging is Meaningfully Combining Information for Science Question Answering?

Arxiv

3+阅读 · 2018年5月29日

Dynamic and Static Topic Model for Analyzing Time-Series Document Collections

Arxiv

8+阅读 · 2018年5月6日

Open Information Extraction on Scientific Text: An Evaluation

Arxiv

6+阅读 · 2018年2月15日

Learning Tree-based Deep Model for Recommender Systems

Arxiv

7+阅读 · 2018年1月8日

Analysis of Wikipedia-based Corpora for Question Answering

Arxiv

7+阅读 · 2018年1月6日

Multilingual Topic Models

Arxiv

3+阅读 · 2017年12月18日

VIP会员

文章信息

相关主题

相关VIP内容

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

专知会员服务

8+阅读 · 2019年11月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

已删除

将门创投

6+阅读 · 2019年9月3日

Linguistically Regularized LSTMs for Sentiment Classification

Linguistically Regularized LSTMs for Sentiment Classification

黑龙江大学自然语言处理实验室

8+阅读 · 2018年5月4日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

Properties of Inconsistency Measures for Databases

Arxiv

0+阅读 · 2021年3月30日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Evolutionary Data Measures: Understanding the Difficulty of Text Classification Tasks

Arxiv

4+阅读 · 2018年11月5日

Topic Modelling of Empirical Text Corpora: Validity, Reliability, and Reproducibility in Comparison to Semantic Maps

Arxiv

4+阅读 · 2018年6月4日

Multi-hop Inference for Sentence-level TextGraphs: How Challenging is Meaningfully Combining Information for Science Question Answering?

Arxiv

3+阅读 · 2018年5月29日

Dynamic and Static Topic Model for Analyzing Time-Series Document Collections

Arxiv

8+阅读 · 2018年5月6日

Open Information Extraction on Scientific Text: An Evaluation

Arxiv

6+阅读 · 2018年2月15日

Learning Tree-based Deep Model for Recommender Systems

Arxiv

7+阅读 · 2018年1月8日

Analysis of Wikipedia-based Corpora for Question Answering

Arxiv

7+阅读 · 2018年1月6日

Multilingual Topic Models

Arxiv

3+阅读 · 2017年12月18日

微信扫码咨询专知VIP会员