Each machine learning model deployed into production has a risk of adversarial attack. Quantifying the contributing factors and uncertainties using empirical measures could assist the industry with assessing the risk of downloading and deploying common machine learning model types. The Drake Equation is famously used for parameterizing uncertainties and estimating the number of radio-capable extra-terrestrial civilizations. This work proposes modifying the traditional Drake Equation's formalism to estimate the number of potentially successful adversarial attacks on a deployed model. While previous work has outlined methods for discovering vulnerabilities in public model architectures, the proposed equation seeks to provide a semi-quantitative benchmark for evaluating the potential risk factors of adversarial attacks.
翻译:利用经验性措施对促成因素和不确定因素进行量化,可有助于该行业评估下载和部署通用机器学习模型类型的风险; Draake 等量法被著名地用于对不确定性进行参数化和估计无线电能力外天体文明的数量; 这项工作提议修改传统的Drake Equation形式主义,以估计对已部署模型进行的潜在成功对抗性攻击的数量; 先前的工作概述了在公共模型结构中发现脆弱性的方法,而拟议的等式则试图为评价对抗性攻击的潜在风险因素提供一个半定量基准。