Translated Title: 大型语言模型可评估新闻媒体的可信度 (Large language models can rate news outlet credibility) - 专知论文

会员服务 ·

0

大型语言模型 · 新闻 · 语言模型 · ChatGPT · 情境 ·

2023 年 4 月 1 日

Large language models can rate news outlet credibility

翻译：Translated Title: 大型语言模型可评估新闻媒体的可信度

Kai-Cheng Yang,Filippo Menczer

from arxiv, 10 pages, 3 figures

Although large language models (LLMs) have shown exceptional performance in various natural language processing tasks, they are prone to hallucinations. State-of-the-art chatbots, such as the new Bing, attempt to mitigate this issue by gathering information directly from the internet to ground their answers. In this setting, the capacity to distinguish trustworthy sources is critical for providing appropriate accuracy contexts to users. Here we assess whether ChatGPT, a prominent LLM, can evaluate the credibility of news outlets. With appropriate instructions, ChatGPT can provide ratings for a diverse set of news outlets, including those in non-English languages and satirical sources, along with contextual explanations. Our results show that these ratings correlate with those from human experts (Spearmam's $\rho=0.54, p<0.001$). These findings suggest that LLMs could be an affordable reference for credibility ratings in fact-checking applications. Future LLMs should enhance their alignment with human expert judgments of source credibility to improve information accuracy.

翻译：Translated Abstract: 尽管大型语言模型（LLM）在各种自然语言处理任务中表现出了卓越的性能，但它们容易出现幻觉。最新的聊天机器人，如新版Bing，尝试通过直接从互联网收集信息来确定其答案。在这种情况下，区分值得信赖的来源对于向用户提供合适的准确性情境至关重要。在这里，我们评估了ChatGPT，一个著名的LLM，能否评估新闻媒体的可信度。通过适当的说明，ChatGPT可以为各种新闻媒体提供评级，包括非英语语言和讽刺性来源以及上下文解释。我们的结果表明，这些评级与人类专家的评级相关（$Spearmam's \ rho =0.54，p <0.001$）。这些发现表明LLM可以成为事实核查应用程序中可信度评级的经济参考。未来的LLM应增强与人类专家判断来源可信度的协调性，以提高信息准确性。

0

相关内容

大型语言模型

大型语言模型

【吴恩达新课程】ChatGPT提示工程，ChatGPT Prompt Engineering for Developers

【吴恩达新课程】ChatGPT提示工程，ChatGPT Prompt Engineering for Developers

专知会员服务

104+阅读 · 2023年4月28日

评估ChatGPT的信息提取能力:对性能、可解释性、校准和忠实度的评估

评估ChatGPT的信息提取能力:对性能、可解释性、校准和忠实度的评估

专知会员服务

74+阅读 · 2023年4月26日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【ChatGPT系列报告】GPT-4及ChatGPT相关应用梳理，33页ppt

【ChatGPT系列报告】GPT-4及ChatGPT相关应用梳理，33页ppt

专知会员服务

327+阅读 · 2023年3月19日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

专知

10+阅读 · 2018年6月8日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

LncRNA介导肿瘤相关巨噬细胞促进乳腺癌转移分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

APOBEC3s与维吾尔族妇女宫颈癌发生发展的相关性研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于多维度文本特征的社区问答答案质量评估研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-207促进肝癌侵袭与转移的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

竞价排名对搜索引擎公正性和效率的影响

国家自然科学基金

0+阅读 · 2013年12月31日

抵抗素诱导猪脂肪异位沉积中miR-34a的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADAMTS8在结直肠癌中的抑癌作用及其负调控MAPK/ERK通路的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-21及miR-214双向调节乳腺癌细胞骨转移的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

三阴性乳腺癌中ppar-gamma去甲基化研究及其临床意义

国家自然科学基金

0+阅读 · 2009年12月31日

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing

Arxiv

0+阅读 · 2023年5月24日

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Arxiv

0+阅读 · 2023年5月24日

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Evaluating Factual Consistency of Summaries with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Flexible Grammar-Based Constrained Decoding for Language Models

Arxiv

0+阅读 · 2023年5月23日

Evaluating and Enhancing Structural Understanding Capabilities of Large Language Models on Tables via Input Designs

Arxiv

0+阅读 · 2023年5月22日

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

Arxiv

0+阅读 · 2023年5月22日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

Reducing Sequence Length by Predicting Edit Operations with Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Comparing Software Developers with ChatGPT: An Empirical Investigation

Arxiv

0+阅读 · 2023年5月19日

VIP会员

文章信息

相关主题

大型语言模型

相关VIP内容

【吴恩达新课程】ChatGPT提示工程，ChatGPT Prompt Engineering for Developers

【吴恩达新课程】ChatGPT提示工程，ChatGPT Prompt Engineering for Developers

专知会员服务

104+阅读 · 2023年4月28日

评估ChatGPT的信息提取能力:对性能、可解释性、校准和忠实度的评估

评估ChatGPT的信息提取能力:对性能、可解释性、校准和忠实度的评估

专知会员服务

74+阅读 · 2023年4月26日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【ChatGPT系列报告】GPT-4及ChatGPT相关应用梳理，33页ppt

【ChatGPT系列报告】GPT-4及ChatGPT相关应用梳理，33页ppt

专知会员服务

327+阅读 · 2023年3月19日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《军事域人工智能风险、机遇与治理战略指导报告》2025最新76页报告

《杀伤网与精确规模：智能饱和战争时代的战略要务-印度视角》2025最新报告

俄乌冲突的地缘政治与军事教训（万字长文）

《弹药快速效能建模：推进互操作性与技术优势》2025最新26页报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

使用BERT做文本摘要

使用BERT做文本摘要

专知

23+阅读 · 2019年12月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

专知

10+阅读 · 2018年6月8日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

相关论文

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing

Arxiv

0+阅读 · 2023年5月24日

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Arxiv

0+阅读 · 2023年5月24日

Active Prompting with Chain-of-Thought for Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Evaluating Factual Consistency of Summaries with Large Language Models

Arxiv

0+阅读 · 2023年5月23日

Flexible Grammar-Based Constrained Decoding for Language Models

Arxiv

0+阅读 · 2023年5月23日

Evaluating and Enhancing Structural Understanding Capabilities of Large Language Models on Tables via Input Designs

Arxiv

0+阅读 · 2023年5月22日

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

Arxiv

0+阅读 · 2023年5月22日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

Reducing Sequence Length by Predicting Edit Operations with Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Comparing Software Developers with ChatGPT: An Empirical Investigation

Arxiv

0+阅读 · 2023年5月19日

相关基金

LncRNA介导肿瘤相关巨噬细胞促进乳腺癌转移分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

APOBEC3s与维吾尔族妇女宫颈癌发生发展的相关性研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于多维度文本特征的社区问答答案质量评估研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-207促进肝癌侵袭与转移的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

竞价排名对搜索引擎公正性和效率的影响

国家自然科学基金

0+阅读 · 2013年12月31日

抵抗素诱导猪脂肪异位沉积中miR-34a的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADAMTS8在结直肠癌中的抑癌作用及其负调控MAPK/ERK通路的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-21及miR-214双向调节乳腺癌细胞骨转移的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

三阴性乳腺癌中ppar-gamma去甲基化研究及其临床意义

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员