Stochastic control problems with delay are challenging due to the path-dependent feature of the system and thus its intrinsic high dimensions. In this paper, we propose and systematically study deep neural networks-based algorithms to solve stochastic control problems with delay features. Specifically, we employ neural networks for sequence modeling (\emph{e.g.}, recurrent neural networks such as long short-term memory) to parameterize the policy and optimize the objective function. The proposed algorithms are tested on three benchmark examples: a linear-quadratic problem, optimal consumption with fixed finite delay, and portfolio optimization with complete memory. Particularly, we notice that the architecture of recurrent neural networks naturally captures the path-dependent feature with much flexibility and yields better performance with more efficient and stable training of the network compared to feedforward networks. The superiority is even evident in the case of portfolio optimization with complete memory, which features infinite delay.
翻译:延迟的斯托克控制问题具有挑战性,因为该系统具有依赖路径的特点,因此具有内在的高度维度。在本文中,我们提出并系统地研究深神经网络的算法,以解决有延迟特性的随机控制问题。具体地说,我们使用神经网络进行序列建模(\emph{例如}),例如长期短期内存等经常性神经网络,以参数化政策,优化客观功能。提议的算法在三个基准示例上进行了测试:线性赤道问题,用固定的有限延迟进行最佳消费,以及用完整的记忆进行组合优化。特别是,我们注意到经常性神经网络的架构自然地以非常灵活的方式捕捉取依赖路径的特征,并且通过对网络进行更有效和更稳定的培训,从而产生更好的性能,而与前方网络相比,更高效和更稳定的网络培训。优势甚至表现在全记忆的组合优化中,其特点是无限延迟。