The embedding and extraction of useful knowledge is a recent trend in machine learning applications, e.g., to supplement existing datasets that are small. Whilst, as the increasing use of machine learning models in security-critical applications, the embedding and extraction of malicious knowledge are equivalent to the notorious backdoor attack and its defence, respectively. This paper studies the embedding and extraction of knowledge in tree ensemble classifiers, and focuses on knowledge expressible with a generic form of Boolean formulas, e.g., robustness properties and backdoor attacks. For the embedding, it is required to be preservative(the original performance of the classifier is preserved), verifiable(the knowledge can be attested), and stealthy(the embedding cannot be easily detected). To facilitate this, we propose two novel, and effective, embedding algorithms, one of which is for black-box settings and the other for white-box settings.The embedding can be done in PTIME. Beyond the embedding, we develop an algorithm to extract the embedded knowledge, by reducing the problem to be solvable with an SMT (satisfiability modulo theories) solver. While this novel algorithm can successfully extract knowledge, the reduction leads to an NP computation. Therefore, if applying embedding as backdoor attacks and extraction as defence, our results suggest a complexity gap (P vs. NP) between the attack and defence when working with tree ensemble classifiers. We apply our algorithms toa diverse set of datasets to validate our conclusion extensively.
翻译:有用知识的嵌入和提取是机器学习应用中的一种最新趋势,例如,用来补充现有的小数据集。虽然由于在安全关键应用中越来越多地使用机器学习模型,恶意知识的嵌入和提取分别相当于臭名昭著的后门攻击及其防御。本文研究了植入和提取树群群分类器中的知识,并侧重于以布林公式通用形式表达的知识,例如,稳健性能和后门攻击。对于嵌入,必须具有防腐蚀性(分类的原始性能得到保存)、可核实性(知识得到验证)和隐性(嵌入知识不能轻易被检测)。为了促进这一点,我们建议两种新颖和有效的嵌入算法,其中一种用于黑箱设置,另一种用于白箱设置。嵌入可以在PTIME中进行。除了嵌入外,我们还开发一种算法来提取嵌入式知识,通过将问题降低为可溶解的问题,而将SMT的精度的精度应用(可验证)和隐化性(嵌入的算法,如果将SMT, IM 将我们的数据缩的精度的精度排序应用新的算法,则可以将SMLIBLM的解后算结果应用为我们的精化后。