法文文件复杂程度量化 (Quantifying French Document Complexity) - 专知论文

会员服务 ·

0

Performer · Learning · contrastive · 值域 · 相似度 ·

2022 年 8 月 27 日

Quantifying French Document Complexity

翻译：法文文件复杂程度量化

Vincent Primpied,David Beauchemin,Richard Khoury

from arxiv, Accepted in CAIA 2022

Measuring a document's complexity level is an open challenge, particularly when one is working on a diverse corpus of documents rather than comparing several documents on a similar topic or working on a language other than English. In this paper, we define a methodology to measure the complexity of French documents, using a new general and diversified corpus of texts, the "French Canadian complexity level corpus", and a wide range of metrics. We compare different learning algorithms to this task and contrast their performances and their observations on which characteristics of the texts are more significant to their complexity. Our results show that our methodology gives a general-purpose measurement of text complexity in French.

翻译：衡量文件的复杂程度是一个公开的挑战,特别是当人们正在编制各种文件,而不是比较关于类似主题的若干文件,或使用英文以外的其他语文时,我们便会使用新的一般和多样化的文本、“法属加拿大复杂程度”和一系列广泛的衡量标准,确定衡量法文文件复杂性的方法。我们比较了不同的学习算法和这项任务,比较了它们的业绩和对哪些文本的特征对其复杂性更为重要的看法。我们的结果显示,我们的方法提供了法文文本复杂性的通用计量方法。

0

相关内容

Performer

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

多源卫星遥感反演气溶胶光学特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

温度对高量子效率光阴极影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

HIV-1 Nef蛋白促进KSHV K1诱导血管和肿瘤形成：信号通路与miRNAs的作用

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs和信号通路在艾滋病毒Vpr蛋白调控卡波氏肉瘤病毒潜伏感染中的作用和意义

国家自然科学基金

0+阅读 · 2011年12月31日

The Power of Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval

Arxiv

0+阅读 · 2022年10月15日

How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels

Arxiv

0+阅读 · 2022年10月14日

The Complexity of NISQ

The Complexity of NISQ

Arxiv

0+阅读 · 2022年10月13日

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

Arxiv

0+阅读 · 2022年10月13日

Active Exploration for Inverse Reinforcement Learning

Arxiv

0+阅读 · 2022年10月12日

VIP会员

文章信息

相关主题

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

《超越连接：AI驱动网络未来愿景》最新报告

人工智能赋能多域作战：能力与挑战

《战场空间决策优势：AI基础与应用研究》总结报告

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

相关论文

The Power of Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval

Arxiv

0+阅读 · 2022年10月15日

How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels

Arxiv

0+阅读 · 2022年10月14日

The Complexity of NISQ

The Complexity of NISQ

Arxiv

0+阅读 · 2022年10月13日

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

Arxiv

0+阅读 · 2022年10月13日

Active Exploration for Inverse Reinforcement Learning

Arxiv

0+阅读 · 2022年10月12日

相关基金

多源卫星遥感反演气溶胶光学特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

温度对高量子效率光阴极影响研究

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

HIV-1 Nef蛋白促进KSHV K1诱导血管和肿瘤形成：信号通路与miRNAs的作用

国家自然科学基金

0+阅读 · 2012年12月31日

miRNAs和信号通路在艾滋病毒Vpr蛋白调控卡波氏肉瘤病毒潜伏感染中的作用和意义

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员