Shell 语言处理: 用于机器学习的 Unix 命令解析 (Shell Language Processing: Unix command parsing for Machine Learning) - 专知论文

会员服务 ·

0

Shell · Learning · Unix · Processing（编程语言） · Machine Learning ·

2022 年 7 月 7 日

Shell Language Processing: Unix command parsing for Machine Learning

翻译：Shell 语言处理: 用于机器学习的 Unix 命令解析

Dmitrijs Trizna

from arxiv, 4 pages, 1 table

In this article, we present a Shell Language Preprocessing (SLP) library, which implements tokenization and encoding directed at parsing Unix and Linux shell commands. We describe the rationale behind the need for a new approach with specific examples of when conventional Natural Language Processing (NLP) pipelines fail. Furthermore, we evaluate our methodology on a security classification task against widely accepted information and communications technology (ICT) tokenization techniques and achieve significant improvement of an F1 score from 0.392 to 0.874.

翻译：在文章中,我们提出一个壳牌语言预处理(SLP)图书馆,该图书馆针对解析Unix和Linux shell命令,实施象征性和编码,我们描述需要采用新办法的理由,具体举例说明常规的自然语言处理(NLP)管道在何时失效;此外,我们对照广泛接受的信息和通信技术(ICT)代用技术,评价我们的安全分类任务方法,并大大改进了F1分,从0.392到0.874分。

0

相关内容

Shell

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

SF6断路器开断过程中灭弧室内动态温度场的测量与特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

有序介孔碳薄膜中碳化钨的定向生长及其电催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

石榴石相LuAG:Ce(Pr)闪烁晶体的缺陷控制和性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

自组装太阳能电池（二）

国家自然科学基金

0+阅读 · 2012年12月31日

高效染料敏化太阳能电池导电聚合物/碳复合对电极的制备及电催化机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

A preprocessing perspective for quantum machine learning classification advantage using NISQ algorithms

Arxiv

0+阅读 · 2022年8月28日

Living-off-the-Land Abuse Detection Using Natural Language Processing and Supervised Learning

Arxiv

0+阅读 · 2022年8月26日

Deep Learning-based approaches for automatic detection of shell nouns and evaluation on WikiText-2

Arxiv

0+阅读 · 2022年8月25日

Graph Neural Networks for Natural Language Processing: A Survey

Arxiv

36+阅读 · 2021年6月10日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

26+阅读 · 2020年3月13日

VIP会员

文章信息

相关主题

Processing（编程语言）

Machine Learning

相关VIP内容

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

A preprocessing perspective for quantum machine learning classification advantage using NISQ algorithms

Arxiv

0+阅读 · 2022年8月28日

Living-off-the-Land Abuse Detection Using Natural Language Processing and Supervised Learning

Arxiv

0+阅读 · 2022年8月26日

Deep Learning-based approaches for automatic detection of shell nouns and evaluation on WikiText-2

Arxiv

0+阅读 · 2022年8月25日

Graph Neural Networks for Natural Language Processing: A Survey

Arxiv

36+阅读 · 2021年6月10日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

26+阅读 · 2020年3月13日

相关基金

SF6断路器开断过程中灭弧室内动态温度场的测量与特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

有序介孔碳薄膜中碳化钨的定向生长及其电催化性能

国家自然科学基金

0+阅读 · 2012年12月31日

石榴石相LuAG:Ce(Pr)闪烁晶体的缺陷控制和性能优化

国家自然科学基金

0+阅读 · 2012年12月31日

自组装太阳能电池（二）

国家自然科学基金

0+阅读 · 2012年12月31日

高效染料敏化太阳能电池导电聚合物/碳复合对电极的制备及电催化机理研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员