Modern software development and operations rely on monitoring to understand how systems behave in production. The data provided by application logs and runtime environment are essential to detect and diagnose undesired behavior and improve system reliability. However, despite the rich ecosystem around industry-ready log solutions, monitoring complex systems and getting insights from log data remains a challenge. Researchers and practitioners have been actively working to address several challenges related to logs, e.g., how to effectively provide better tooling support for logging decisions to developers, how to effectively process and store log data, and how to extract insights from log data. A holistic view of the research effort on logging practices and automated log analysis is key to provide directions and disseminate the state-of-the-art for technology transfer. In this paper, we study 108 papers (72 research track papers, 24 journals, and 12 industry track papers) from different communities (e.g., machine learning, software engineering, and systems) and structure the research field in light of the life-cycle of log data. Our analysis shows that (1) logging is challenging not only in open-source projects but also in industry, (2) machine learning is a promising approach to enable a contextual analysis of source code for log recommendation but further investigation is required to assess the usability of those tools in practice, (3) few studies approached efficient persistence of log data, and (4) there are open opportunities to analyze application logs and to evaluate state-of-the-art log analysis techniques in a DevOps context.
翻译:应用日志和运行环境提供的数据对于检测和诊断不理想行为和提高系统可靠性至关重要。然而,尽管行业成熟的日志解决方案周围生态系统丰富,但监测复杂系统和从日志数据中获得洞察力仍是一个挑战。研究人员和从业人员一直在积极努力应对与日志有关的若干挑战,例如,如何有效地为开发者伐木决定提供更好的工具支持,如何有效地处理和储存日志数据,以及如何从日志数据中提取洞察力。从记录数据中获取洞察力。全面审视伐木做法和自动日志分析是提供指导和传播技术转让最新技术的关键。在本文件中,我们研究了来自不同社区的108份文件(72份研究轨道文件、24份期刊和12份行业轨道文件),(例如,机器学习、软件工程和系统),并根据日志数据生命周期数据数据数据生命周期构建研究领域。我们的分析表明:(1) 伐木不仅在公开源项目中,而且在工业中也是挑战性。(2) 机床学习是一种很有希望的做法,以便从背景源代码中进行持续分析的机会。