Logs are widely used to record runtime information of software systems, such as the timestamp and the importance of an event, the unique ID of the source of the log, and a part of the state of a task's execution. The rich information of logs enables system developers (and operators) to monitor the runtime behaviors of their systems and further track down system problems and perform analysis on log data in production settings. However, the prior research on utilizing logs is scattered and that limits the ability of new researchers in this field to quickly get to the speed and hampers currently active researchers to advance this field further. Therefore, this paper surveys and provides a systematic literature review of the contemporary logging practices and log statements' mining and monitoring techniques and their applications such as in system failure detection and diagnosis. We study a large number of conference and journal papers that appeared on top-level peer-reviewed venues. Additionally, we draw high-level trends of ongoing research and categorize publications into subdivisions. In the end, and based on our holistic observations during this survey, we provide a set of challenges and opportunities that will lead the researchers in academia and industry in moving the field forward.
翻译:日志的丰富信息使系统开发者(和操作者)能够监测其系统的运行时间行为,进一步跟踪系统问题,并分析生产环境中的日志数据。然而,先前关于使用日志的研究分散,限制了该领域新研究人员迅速达到速度的能力,妨碍了目前活跃的研究人员进一步推进这一领域的工作。因此,本文对当代伐木做法和日志的开采和监测技术及其应用,如系统故障检测和诊断,进行了系统化文献审查,并提供了系统化文献审查。我们研究了在最高级别同行审查地点出现的大量会议和日记文件。此外,我们将正在进行的研究和出版物分类到子类的高度趋势。最后,根据我们在这次调查中的整体观察,我们提供了一组挑战和机遇,引导学术界和工业界的研究人员向前推进。