A deep reinforcement learning technique is presented for task offloading decision-making algorithms for a multi-access edge computing (MEC) assisted unmanned aerial vehicle (UAV) network in a smart farm Internet of Things (IoT) environment. The task offloading technique uses financial concepts such as cost functions and conditional variable at risk (CVaR) in order to quantify the damage that may be caused by each risky action. The approach was able to quantify potential risks to train the reinforcement learning agent to avoid risky behaviors that will lead to irreversible consequences for the farm. Such consequences include an undetected fire, pest infestation, or a UAV being unusable. The proposed CVaR-based technique was compared to other deep reinforcement learning techniques and two fixed rule-based techniques. The simulation results show that the CVaR-based risk quantifying method eliminated the most dangerous risk, which was exceeding the deadline for a fire detection task. As a result, it reduced the total number of deadline violations with a negligible increase in energy consumption.
翻译:深度强化学习技术用于在智能农场Things(IoT)互联网环境中对多接入边缘计算辅助无人驾驶飞行器(无人驾驶飞行器)网络进行任务卸载决策算法任务,该任务卸载技术使用成本功能和条件风险变量等金融概念,以量化每项风险行动可能造成的损害;该方法能够量化潜在风险,培训强化学习代理以避免危险行为对农场造成不可逆转的后果,这些后果包括未发现火灾、虫害或无人驾驶飞行器无法使用。拟议的CVaR技术与其他深度强化学习技术和两项固定规则技术进行了比较。模拟结果表明,基于CVaR的风险量化方法消除了最危险的风险,超过了火灾探测任务的最后期限。因此,它减少了最后期限违规的总数,能源消耗量略有增加。