A myriad of recent literary works has leveraged generative adversarial networks (GANs) to spawn unseen evasion samples. The purpose is to annex the generated data with the original train set for adversarial training to improve the detection performance of machine learning (ML) classifiers. The quality of generating adversarial samples relies on the adequacy of training data samples. However, in low data regimes like medical diagnostic imaging and cybersecurity, the anomaly samples are scarce in number. This paper proposes a novel GAN design called Evasion Generative Adversarial Network (EVAGAN) that is more suitable for low data regime problems that use oversampling for detection improvement of ML classifiers. EVAGAN not only can generate evasion samples, but its discriminator can act as an evasion aware classifier. We have considered Auxiliary Classifier GAN (ACGAN) as a benchmark to evaluate the performance of EVAGAN on cybersecurity (ISCX-2014, CIC-2017 and CIC2018) botnet and computer vision (MNIST) datasets. We demonstrate that EVAGAN outperforms ACGAN for unbalanced datasets with respect to detection performance, training stability and time complexity. EVAGAN's generator quickly learns to generate the low sample class and hardens its discriminator simultaneously. In contrast to ML classifiers that require security hardening after being adversarially trained by GAN generated data, EVAGAN renders it needless. The experimental analysis proves that EVAGAN is an efficient evasion hardened model for low data regimes for the selected cybersecurity and computer vision datasets. Code will be available at https://github.com/rhr407/EVAGAN.
翻译:最近大量文学作品利用了基因对抗网络(GANs)来生成逃避的样本。目的是将生成的数据与最初的对抗性训练列列列列的原始列车合在一起,以提高机器学习(ML)分类人员的检测性能。生成对抗性抽样的质量取决于培训数据样本是否充足。然而,在医疗诊断成像和网络安全等低数据系统中,异常抽样数量很少。本文件建议采用一种新型的GAN设计,称为Evasion General Adversarial 网络(EVAGAN),它更适合低数据系统问题,而低数据系统使用过度抽样来改进ML分类者的检测性能。EVAGAN不仅可以生成逃避性样,而且其导师也可以作为规避性能分析者。我们认为ARCAAN(ACANAN)作为评估EVAN(IS-2014、CIC-2017和CIC2018模型)网络和计算机观察系统(MNIST)的性能。我们表明,EVAAN比对不平衡的AGGANAN(精细化数据分析)的精确度分析需要快速的系统。AVAGAG的稳定性和经训练后, 将使得GAGARC级数据变压的精细化数据变的精制成。