In this paper, we propose a planning framework to generate a defense strategy against an attacker who is working in an environment where a defender can operate without the attacker's knowledge. The objective of the defender is to covertly guide the attacker to a trap state from which the attacker cannot achieve their goal. Further, the defender is constrained to achieve its goal within K number of steps, where K is calculated as a pessimistic lower bound within which the attacker is unlikely to suspect a threat in the environment. Such a defense strategy is highly useful in real world systems like honeypots or honeynets, where an unsuspecting attacker interacts with a simulated production system while assuming it is the actual production system. Typically, the interaction between an attacker and a defender is captured using game theoretic frameworks. Our problem formulation allows us to capture it as a much simpler infinite horizon discounted MDP, in which the optimal policy for the MDP gives the defender's strategy against the actions of the attacker. Through empirical evaluation, we show the merits of our problem formulation.
翻译:在本文中,我们提出了一个计划框架,用于生成一种防御策略,针对的是一个攻击者在一个没有防御者知晓的环境中工作的形式。 防御者的目标是秘密地指导攻击者进入陷阱状态,从中攻击者无法实现其目标。此外,防御者受限于在K步内实现其目标,其中K被计算为一个悲观的下限,在该下限内,攻击者不太可能怀疑环境中存在威胁。这种防御策略在实际系统中非常有用,例如蜜罐或蜜网,其中一个毫无戒心的攻击者与模拟生产系统交互,同时假定它是实际生产系统。通常,攻击者和防御者之间的交互使用博弈论框架来捕捉。我们的问题定义允许将其捕捉为一个简单的无限时间折扣MDP,在其中,MDP的最优策略提供了敌手行动的防御者策略。通过实证评估,我们展示了我们的问题定义的优点。