校验以小于每字节一个指令校验 UTF-8 (Validating UTF-8 In Less Than One Instruction Per Byte) - 专知论文

会员服务 ·

0

Less · 数据库 ·

2020 年 10 月 10 日

Validating UTF-8 In Less Than One Instruction Per Byte

翻译：校验以小于每字节一个指令校验 UTF-8

John Keiser,Daniel Lemire

The majority of text is stored in UTF-8, which must be validated on ingestion. We present the lookup algorithm, which outperforms UTF-8 validation routines used in many libraries and languages by more than 10 times using commonly available SIMD instructions. To ensure reproducibility, our work is freely available as open source software.

翻译：大部分文本都储存在UTF-8, 必须在摄入时验证。我们展示了搜索算法,它比许多图书馆和语言中使用的UTF-8验证程序成功10倍以上,使用了通用的SIMD 指令。为了确保可复制性,我们的工作可以免费作为开放源代码软件提供。

0

相关内容

Less

LESS 是一个开源的样式语言，受到 Sass 的影响。严格来说，LESS 是一个嵌套的元语言，符合语法规范的 CSS 语句也是符合规范的 Less 代码。

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

专知会员服务

11+阅读 · 2019年11月16日

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

专知会员服务

8+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

专知会员服务

15+阅读 · 2019年11月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

德先生

53+阅读 · 2019年4月28日

Approximate Cross-Validation for Structured Models

Approximate Cross-Validation for Structured Models

Arxiv

0+阅读 · 2020年12月1日

Modularising Verification Of Durable Opacity

Arxiv

0+阅读 · 2020年11月30日

Debug-Localize-Repair: A Symbiotic Construction for Heap Manipulations

Arxiv

0+阅读 · 2020年11月26日

All Word Embeddings from One Embedding

Arxiv

4+阅读 · 2020年5月25日

Span Based Open Information Extraction

Arxiv

3+阅读 · 2019年3月1日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

VizWiz Grand Challenge: Answering Visual Questions from Blind People

Arxiv

3+阅读 · 2018年4月2日

What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)

Arxiv

3+阅读 · 2018年2月20日

Word Translation Without Parallel Data

Arxiv

8+阅读 · 2018年1月30日

DVQA: Understanding Data Visualizations via Question Answering

Arxiv

8+阅读 · 2018年1月24日

VIP会员

文章信息

相关主题

相关VIP内容

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

专知会员服务

11+阅读 · 2019年11月16日

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

【O'Reilly AI Conference 2019】高管简报：从落后者到领导者-赢得AI竞赛（Executive Briefing: From laggard to leader—Winning the AI race），Anastasia Kouvela , Bharath Thota

专知会员服务

8+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

【O'Reilly AI Conference 2019】当飞行比停下便宜时，When flying is cheaper than standing still，苏黎世联邦理工学院Raffaello D'Andrea教授

专知会员服务

15+阅读 · 2019年11月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

已删除

德先生

53+阅读 · 2019年4月28日

相关论文

Approximate Cross-Validation for Structured Models

Approximate Cross-Validation for Structured Models

Arxiv

0+阅读 · 2020年12月1日

Modularising Verification Of Durable Opacity

Arxiv

0+阅读 · 2020年11月30日

Debug-Localize-Repair: A Symbiotic Construction for Heap Manipulations

Arxiv

0+阅读 · 2020年11月26日

All Word Embeddings from One Embedding

Arxiv

4+阅读 · 2020年5月25日

Span Based Open Information Extraction

Arxiv

3+阅读 · 2019年3月1日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

VizWiz Grand Challenge: Answering Visual Questions from Blind People

Arxiv

3+阅读 · 2018年4月2日

What is Wrong with Topic Modeling? (and How to Fix it Using Search-based Software Engineering)

Arxiv

3+阅读 · 2018年2月20日

Word Translation Without Parallel Data

Arxiv

8+阅读 · 2018年1月30日

DVQA: Understanding Data Visualizations via Question Answering

Arxiv

8+阅读 · 2018年1月24日

微信扫码咨询专知VIP会员