This chapter is dedicated to the assessment and performance estimation of machine learning (ML) algorithms, a topic that is equally important to the construction of these algorithms, in particular in the context of cyberphysical security design. The literature is full of nonparametric methods to estimate a statistic from just one available dataset through resampling techniques, e.g., jackknife, bootstrap and cross validation (CV). Special statistics of great interest are the error rate and the area under the ROC curve (AUC) of a classification rule. The importance of these resampling methods stems from the fact that they require no knowledge about the probability distribution of the data or the construction details of the ML algorithm. This chapter provides a concise review of this literature to establish a coherent theoretical framework for these methods that can estimate both the error rate (a one-sample statistic) and the AUC (a two-sample statistic). The resampling methods are usually computationally expensive, because they rely on repeating the training and testing of a ML algorithm after each resampling iteration. Therefore, the practical applicability of some of these methods may be limited to the traditional ML algorithms rather than the very computationally demanding approaches of the recent deep neural networks (DNN). In the field of cyberphysical security, many applications generate structured (tabular) data, which can be fed to all traditional ML approaches. This is in contrast to the DNN approaches, which favor unstructured data, e.g., images, text, voice, etc.; hence, the relevance of this chapter to this field.%
翻译:本章专门论述机器学习算法(ML)的评估和性能估计,这是一个对构建这些算法同样重要的主题,特别是在网络物理安全设计方面。文献中充满了非对称方法,通过重新抽样技术,例如,杰克奈夫、靴套和交叉校验(CV),从一个可用的数据集中估算统计数据。非常有意义的特殊统计数据是一个分类规则的ROC曲线(AUC)下的错误率和区域。这些重现方法的重要性源于这样一个事实,即它们不需要了解数据或ML算法的构建细节的概率分布。这一章节简要地审视了这一文献,以建立一个连贯的理论框架,用以估算出错误率(一号统计、靴套和交叉校验(C.V.)和AUC(二号统计)。 重现方法通常具有计算成本,因为每次重现后都重复对ML算法的培训和测试。因此,这些方法的不精准性数据或构建细节细节,这些方法的实际适用性在ML的网络中,这种结构域中,这种结构域的系统应用性方法可能限制于传统的ML的深度计算方法,因此,而ML 的 RAL 的模型的实地方法可进行。