深层状态推断:走向黑箱软件系统行为模型推断 (Deep State Inference: Toward Behavioral Model Inference of Black-box Software Systems)

Many software engineering tasks, such as testing, and anomaly detection can benefit from the ability to infer a behavioral model of the software.Most existing inference approaches assume access to code to collect execution sequences. In this paper, we investigate a black-box scenario, where the system under analysis cannot be instrumented, in this granular fashion.This scenario is particularly prevalent with control systems' log analysis in the form of continuous signals. In this situation, an execution trace amounts to a multivariate time-series of input and output signals, where different states of the system correspond to different `phases` in the time-series. The main challenge is to detect when these phase changes take place. Unfortunately, most existing solutions are either univariate, make assumptions on the data distribution, or have limited learning power.Therefore, we propose a hybrid deep neural network that accepts as input a multivariate time series and applies a set of convolutional and recurrent layers to learn the non-linear correlations between signals and the patterns over time.We show how this approach can be used to accurately detect state changes, and how the inferred models can be successfully applied to transfer-learning scenarios, to accurately process traces from different products with similar execution characteristics. Our experimental results on two UAV autopilot case studies indicate that our approach is highly accurate (over 90% F1 score for state classification) and significantly improves baselines (by up to 102% for change point detection).Using transfer learning we also show that up to 90% of the maximum achievable F1 scores in the open-source case study can be achieved by reusing the trained models from the industrial case and only fine tuning them using as low as 5 labeled samples, which reduces the manual labeling effort by 98%.

翻译：许多软件工程任务,例如测试和异常检测,都可以从推断软件行为模型的能力中受益。多数现有推断方法假定使用代码来收集执行序列。在本文件中, 我们调查了一个黑箱情景, 正在分析的系统无法以这种颗粒方式使用仪器。这个情景在控制系统日志分析中以连续信号的形式特别普遍。在这种情况下, 执行跟踪相当于输入和输出信号的多变时间序列, 该系统的不同状态对应时间序列中的不同“ 阶段 ” 。主要的挑战是如何在这些阶段变化发生时发现可实现源代码。不幸的是, 多数现有解决方案要么是非ivariate, 假设数据分布, 要么是有限的学习能力。因此, 我们提出一个混合的深神经网络, 接受一个多变时间序列, 并应用一组变动和反复的层来学习信号和产出信号之间的非线性关联。我们展示了这个方法可以用来精确地检测时间序列中的“ 阶段 ” 。用于精确度变化, 并且用推算模型可以成功地应用到快速的F级模型。