字体 : 从失败中查找错误引出文件 (Fonte: Finding Bug Inducing Commits from Failures) - 专知论文

会员服务 ·

0

Bug · 可约的 · INFORMS · 秩 · 代价 ·

2023 年 2 月 15 日

Fonte: Finding Bug Inducing Commits from Failures

翻译：字体 : 从失败中查找错误引出文件

Gabin An,Jingun Hong,Naryeong Kim,Shin Yoo

from arxiv, 13 pages, 10 figures, 5 tables (to be published in ICSE'23)

A Bug Inducing Commit (BIC) is a commit that introduces a software bug into the codebase. Knowing the relevant BIC for a given bug can provide valuable information for debugging as well as bug triaging. However, existing BIC identification techniques are either too expensive (because they require the failing tests to be executed against previous versions for bisection) or inapplicable at the debugging time (because they require post hoc artefacts such as bug reports or bug fixes). We propose Fonte, an efficient and accurate BIC identification technique that only requires test coverage. Fonte combines Fault Localisation (FL) with BIC identification and ranks commits based on the suspiciousness of the code elements that they modified. Fonte reduces the search space of BICs using failure coverage as well as a filter that detects commits that are merely style changes. Our empirical evaluation using 130 real-world BICs shows that Fonte significantly outperforms state-of-the-art BIC identification techniques based on Information Retrieval as well as neural code embedding models, achieving at least 39% higher MRR. We also report that the ranking scores produced by Fonte can be used to perform weighted bisection, further reducing the cost of BIC identification. Finally, we apply Fonte to a large-scale industry project with over 10M lines of code, and show that it can rank the actual BIC within the top five commits for 87% of the studied real batch-testing failures, and save the BIC inspection cost by 32% on average.

翻译：错误导引器( BIC) 是一种在代码库中引入软件错误的承诺( BIC) 。了解给定错误的相关 BIC 可以提供宝贵的信息进行调试和错误处理。但是, 现有的 BIC 识别技术要么太昂贵( 因为它们要求对前版本的分解进行失败测试), 要么在调试时不适用( 因为需要错误报告或错误修正等临时人工制品) 。我们提议Fontee, 这是一种高效和准确的 BIC 识别技术, 只需要测试范围。 Fonte 将错误本地化( FL) 与 BIC 识别和级别结合起来, 可以基于他们修改的代码的代码的可疑性提供有价值的信息。 Fonte 减少了 BIC 的搜索空间, 以及一个过滤器, 因为它们需要对前一个版本进行测试, 因为它们需要用130个真实世界的 BIC 来进行测试。我们的经验评估Fonte 大大超出了基于信息Relieval 和神经代码嵌嵌入模型的状态, 达到至少39% MIC 。我们还报告BIC 的排序, 用于BIC 10 的BIC 的高级测试, 最后显示了BIC 10 的升级的升级的排序, 。

0

相关内容

Bug

程序猿的天敌有时是一个不能碰的magic

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

lncRNA HOTAIR与MRTF-A协同促进肿瘤细胞迁移和侵袭的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

小系统中的反常扩散和Kramers逃逸问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

无限闭凸集族凸可行性问题中投影算法的线性收敛

国家自然科学基金

0+阅读 · 2015年12月31日

Tfh细胞-IL21-B细胞轴在慢性乙型肝炎HBeAg血清学转换中的作用及补肾方的干预机制

国家自然科学基金

0+阅读 · 2014年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号调控不同分化状态间充质干细胞定向迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

Data AUDIT: Identifying Attribute Utility- and Detectability-Induced Bias in Task Models

Data AUDIT: Identifying Attribute Utility- and Detectability-Induced Bias in Task Models

Arxiv

0+阅读 · 2023年4月6日

Smart Contract and DeFi Security: Insights from Tool Evaluations and Practitioner Surveys

Arxiv

0+阅读 · 2023年4月6日

Classifying Mental-Disorders through Clinicians Subjective Approach based on Three-way Decision

Arxiv

0+阅读 · 2023年4月5日

Explainable Automated Debugging via Large Language Model-driven Scientific Debugging

Arxiv

0+阅读 · 2023年4月5日

A Categorical Archive of ChatGPT Failures

Arxiv

0+阅读 · 2023年4月3日

RunBugRun -- An Executable Dataset for Automated Program Repair

Arxiv

0+阅读 · 2023年4月3日

Data-graph repairs: the preferred approach

Arxiv

0+阅读 · 2023年4月3日

The communication complexity of functions with large outputs

Arxiv

0+阅读 · 2023年4月1日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

VIP会员

文章信息

相关主题

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Data AUDIT: Identifying Attribute Utility- and Detectability-Induced Bias in Task Models

Data AUDIT: Identifying Attribute Utility- and Detectability-Induced Bias in Task Models

Arxiv

0+阅读 · 2023年4月6日

Smart Contract and DeFi Security: Insights from Tool Evaluations and Practitioner Surveys

Arxiv

0+阅读 · 2023年4月6日

Classifying Mental-Disorders through Clinicians Subjective Approach based on Three-way Decision

Arxiv

0+阅读 · 2023年4月5日

Explainable Automated Debugging via Large Language Model-driven Scientific Debugging

Arxiv

0+阅读 · 2023年4月5日

A Categorical Archive of ChatGPT Failures

Arxiv

0+阅读 · 2023年4月3日

RunBugRun -- An Executable Dataset for Automated Program Repair

Arxiv

0+阅读 · 2023年4月3日

Data-graph repairs: the preferred approach

Arxiv

0+阅读 · 2023年4月3日

The communication complexity of functions with large outputs

Arxiv

0+阅读 · 2023年4月1日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

lncRNA HOTAIR与MRTF-A协同促进肿瘤细胞迁移和侵袭的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

小系统中的反常扩散和Kramers逃逸问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

无限闭凸集族凸可行性问题中投影算法的线性收敛

国家自然科学基金

0+阅读 · 2015年12月31日

Tfh细胞-IL21-B细胞轴在慢性乙型肝炎HBeAg血清学转换中的作用及补肾方的干预机制

国家自然科学基金

0+阅读 · 2014年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号调控不同分化状态间充质干细胞定向迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员