This work considers online optimal motion planning of an autonomous agent subject to linear temporal logic (LTL) constraints. The environment is dynamic in the sense of containing mobile obstacles and time-varying areas of interest (i.e., time-varying reward and workspace properties) to be visited by the agent. Since user-specified tasks may not be fully realized (i.e., partially infeasible), this work considers hard and soft LTL constraints, where hard constraints enforce safety requirement (e.g. avoid obstacles) while soft constraints represent tasks that can be relaxed to not strictly follow user specifications. The motion planning of the agent is to generate policies, in decreasing order of priority, to 1) formally guarantee the satisfaction of safety constraints; 2) mostly satisfy soft constraints (i.e., minimize the violation cost if desired tasks are partially infeasible); and 3) optimize the objective of rewards collection (i.e., visiting dynamic areas of more interests). To achieve these objectives, a relaxed product automaton, which allows the agent to not strictly follow the desired LTL constraints, is constructed. A utility function is developed to quantify the differences between the revised and the desired motion plan, and the accumulated rewards are designed to bias the motion plan towards those areas of more interests. Receding horizon control is synthesized with an LTL formula to maximize the accumulated utilities over a finite horizon, while ensuring that safety constraints are fully satisfied and soft constraints are mostly satisfied. Simulation and experiment results are provided to demonstrate the effectiveness of the developed motion strategy.
翻译:这项工作考虑的是受线性时间逻辑(LTL)制约的自主代理商在线最佳运动规划;环境是动态的,其含义是控制移动障碍和代理商将访问的时间变化兴趣领域(即时间变化奖励和工作空间属性),由代理商访问。由于用户指定的任务可能无法完全完成(即部分不可行),这项工作考虑的是软硬的LTL限制,其中硬性限制强制执行安全要求(例如避免障碍),而软性限制代表了不严格遵循用户要求的任务。该代理商的行动规划是制定政策,按优先次序减少,以便1 正式保证安全限制的满意度;2 大部分满足软性限制(即,如果预期任务部分不可行,则尽量减少违规费用); 和 3) 优化奖励收集的目标(即访问更多兴趣的动态领域),为了实现这些目标,将放松产品自动图解,使代理商不严格遵循LTL的限制。 正在构建一种实用功能,主要用来量化所期望的利率差异,同时确定所设计的软性灵活性,同时确定所设计的弹性战略的稳定性是符合L的累进程度。