In this paper, we present a novel system for fault injection in production for Java applications. The unique feature of this system is to combine automated monitoring, automated perturbation injection, and automated resilience improvement. The latter is achieved with ideas coming from the failure-oblivious literature. We design and implement the system as agents for the Java virtual machine. We evaluate the system on a real-world application for transferring files with the BitTorrent protocol. Our results shows that it is possible to automatically improve the resilience of Java applications with respect to uncaught exceptions.
翻译:在本文中,我们为爪哇应用软件的制作提供了一个新的错误注入系统,该系统的独有特点是将自动监测、自动扰动注射和自动复原力改进结合起来,后者是通过从显出故障的文献中产生的想法来实现的。我们设计和实施这个系统,作为爪哇虚拟机器的代理。我们用BitTorrent协议对用于传输文件的实时应用系统进行评估。我们的结果表明,可以自动提高爪哇应用软件在未发现例外方面的复原力。