The task of intercepting a target moving along a rectilinear or circular trajectory by a Dubins' car is formulated as a time-optimal control problem with an arbitrary direction of the car's velocity at the interception moment. To solve this problem and to synthesize interception trajectories, neural network methods of unsupervised learning based on the Deep Deterministic Policy Gradient algorithm are used. The analysis of the obtained control laws and interception trajectories in comparison with the analytical solutions of the interception problem is performed. The mathematical modeling for the parameters of the target movement that the neural network had not seen before during training is carried out. Model experiments are conducted to test the stability of the neural solution. The effectiveness of using neural network methods for the synthesis of interception trajectories for given classes of target movements is shown.
翻译:将沿着直线或圆形轨迹移动的目标用Dubins小车拦截的任务被看作是一个具有拦截时任意方向车速的最优控制问题。为了解决这个问题并合成拦截轨迹,采用了基于深度确定性策略梯度算法的无监督学习的神经网络方法。对所得到的控制规律和拦截轨迹进行了与拦截问题的解析解比较的分析。对神经网络之前未见过的目标运动参数进行了数学建模。进行了模型实验来测试神经解决方案的稳定性。论文证明了使用神经网络方法合成给定目标运动类别的拦截轨迹的有效性。