SemParser: 用于日志分析的语义解析器 (SemParser: A Semantic Parser for Log Analysis) - 专知论文

会员服务 ·

0

INFORMS · MINE · 可辨认的 · Performer · Seven ·

2021 年 12 月 25 日

SemParser: A Semantic Parser for Log Analysis

翻译：SemParser: 用于日志分析的语义解析器

Yintong Huo,Yuxin Su,Baitong Li,Michael R. Lyu

Logs, being run-time information automatically generated by software, record system events and activities with their timestamps. Before obtaining more insights about the run-time status of the software, a fundamental step of log analysis, called log parsing, is employed to extract structured templates and parameters from the semi-structured raw log messages. However, current log parsers regard each message as a character string, ignoring the semantic information included in parameters and templates. Thus, we propose the semantic parser SemParser to unlock the critical bottleneck of mining semantics from log messages. It contains two steps, an end-to-end semantic miner and a joint parser. Specifically, the first step aims to identify explicit semantics inside a single log, and the second step is responsible for jointly inferring implicit semantics and computing structural outputs based on the contextual knowledge base. To analyze the effectiveness of our semantic parser, we first demonstrate that it can derive rich semantics from log messages collected from seven widely-applied systems with an average F1 score of 0.987. Then, we conduct two representative downstream tasks, showing that current downstream techniques improve their performance with appropriately extracted semantics by 11.7% and 8.65% in anomaly detection and failure diagnosis tasks, respectively. We believe these findings provide insights into semantically understanding log messages for the log analysis community.

翻译：日志, 由软件自动生成运行时间信息, 记录系统事件和活动及其时间戳自动生成。在获得更多关于软件运行时间状态的深入了解之前, 日志分析的基本步骤, 称为日志分析, 用于从半结构原始日志信息中提取结构化模板和参数。然而, 当前日志分析者将每条信息视为字符串, 忽略参数和模板中包含的语义信息。因此, 我们建议语义分析器 SemParser 从日志信息中解开采矿语义学的关键瓶颈。它包含两个步骤, 包括一个从终端到终端的语义挖掘器和一个联合剖析器。具体地说, 第一步旨在从一个半结构化的原始日志中找出明确的语义模板和参数。第二步是共同推断隐含语义的字符串, 忽略参数和模板中包含的语义信息。因此, 我们首先证明它可以从7个广泛应用的系统收集的日志信息中获取丰富的语义学内容。它包含平均的 F1 至终端的语义挖掘探测器和一个联合读取器。。。。第一步, 我们进行两个具有代表性的路径分析任务, 分析结果分析, 分析, 分析分析分析分析结果分析分析分析分析分析分析分析

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

随机辛算法和多辛算法

国家自然科学基金

2+阅读 · 2014年12月31日

线性算子的谱结构及其扰动分析

国家自然科学基金

0+阅读 · 2014年12月31日

基于混合Petri网的电力CPS协同建模与分析

国家自然科学基金

2+阅读 · 2013年12月31日

网络化多智能体系统预测控制设计与分析

国家自然科学基金

1+阅读 · 2012年12月31日

基于定理证明的多核并行程序验证

国家自然科学基金

0+阅读 · 2012年12月31日

基于不确定先验知识的支持向量机理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式采样一致性控制

国家自然科学基金

0+阅读 · 2012年12月31日

缆系式紧耦合多机器人系统协调建模及稳定性分析

国家自然科学基金

0+阅读 · 2012年12月31日

基于変分PDE的显著特征提取及其在图像检索中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

Sound-Guided Semantic Video Generation

Arxiv

0+阅读 · 2022年4月20日

Using a Semantic Knowledge Base to Improve the Management of Security Reports in Industrial DevOps Projects

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

Category-theoretical Semantics of the Description Logic ALC (extended version)

Arxiv

0+阅读 · 2022年4月18日

Semantic-based Data Augmentation for Math Word Problems

Arxiv

0+阅读 · 2022年4月18日

Nested Named Entity Recognition as Holistic Structure Parsing

Arxiv

0+阅读 · 2022年4月17日

Qtrade AI at SemEval-2022 Task 11: An Unified Framework for Multilingual NER Task

Arxiv

0+阅读 · 2022年4月14日

Linguistically-Informed Self-Attention for Semantic Role Labeling

Arxiv

17+阅读 · 2018年8月28日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

Deep Semantic Role Labeling with Self-Attention

Arxiv

13+阅读 · 2017年12月5日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】多目标奖励与偏好优化：理论与算法

《无形的防御者？将定向能武器集成到反无人机框架的机遇与挑战》报告

自主化海军：海上无人系统与未来海战

迈向智能体系统规模化的科学

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

上百种预训练中文词向量：Chinese-Word-Vectors

上百种预训练中文词向量：Chinese-Word-Vectors

AINLP

23+阅读 · 2019年2月26日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

Sound-Guided Semantic Video Generation

Arxiv

0+阅读 · 2022年4月20日

Using a Semantic Knowledge Base to Improve the Management of Security Reports in Industrial DevOps Projects

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

Category-theoretical Semantics of the Description Logic ALC (extended version)

Arxiv

0+阅读 · 2022年4月18日

Semantic-based Data Augmentation for Math Word Problems

Arxiv

0+阅读 · 2022年4月18日

Nested Named Entity Recognition as Holistic Structure Parsing

Arxiv

0+阅读 · 2022年4月17日

Qtrade AI at SemEval-2022 Task 11: An Unified Framework for Multilingual NER Task

Arxiv

0+阅读 · 2022年4月14日

Linguistically-Informed Self-Attention for Semantic Role Labeling

Arxiv

17+阅读 · 2018年8月28日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

Deep Semantic Role Labeling with Self-Attention

Arxiv

13+阅读 · 2017年12月5日

相关基金

随机辛算法和多辛算法

国家自然科学基金

2+阅读 · 2014年12月31日

线性算子的谱结构及其扰动分析

国家自然科学基金

0+阅读 · 2014年12月31日

基于混合Petri网的电力CPS协同建模与分析

国家自然科学基金

2+阅读 · 2013年12月31日

网络化多智能体系统预测控制设计与分析

国家自然科学基金

1+阅读 · 2012年12月31日

基于定理证明的多核并行程序验证

国家自然科学基金

0+阅读 · 2012年12月31日

基于不确定先验知识的支持向量机理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式采样一致性控制

国家自然科学基金

0+阅读 · 2012年12月31日

缆系式紧耦合多机器人系统协调建模及稳定性分析

国家自然科学基金

0+阅读 · 2012年12月31日

基于変分PDE的显著特征提取及其在图像检索中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员