Discrete-choice life cycle models of labor supply can be used to estimate how social security reforms influence employment rate. In a life cycle model, optimal employment choices during the life course of an individual must be solved. Mostly, life cycle models have been solved with dynamic programming, which is not feasible when the state space is large, as often is the case in a realistic life cycle model. Solving a complex life cycle model requires the use of approximate methods, such as reinforced learning algorithms. We compare how well a deep reinforced learning algorithm ACKTR and dynamic programming solve a relatively simple life cycle model. To analyze results, we use a selection of statistics and also compare the resulting optimal employment choices at various states. The statistics demonstrate that ACKTR yields almost as good results as dynamic programming. Qualitatively, dynamic programming yields more spiked aggregate employment profiles than ACKTR. The results obtained with ACKTR provide a good, yet not perfect, approximation to the results of dynamic programming. In addition to the baseline case, we analyze two social security reforms: (1) an increase of retirement age, and (2) universal basic income. Our results suggest that reinforced learning algorithms can be of significant value in developing social security reforms.
翻译:劳动供应的复杂生命周期模型可以用来估计社会保障改革如何影响就业率。在生命周期模型中,必须解决个人一生中的最佳就业选择。多数情况下,生命周期模型是通过动态编程解决的,当国家空间很大时,这是行不通的,在现实生命周期模型中往往是这样。解决复杂的生命周期模型需要使用近似方法,例如强化学习算法。我们比较了深度强化的学习算法ACKTR和动态编程如何能解决相对简单的生命周期模型。为了分析结果,我们选择了一些统计数据,并比较了各个州的最佳就业选择。统计数据表明,ACKTR几乎像动态编程那样产生良好的结果。从本质上讲,动态编程能够产生比ACKTR更多的快速综合就业概况。用ACKTR获得的结果为动态编程的结果提供了良好但并非完美的近似。除了基线案例外,我们分析了两种社会保障改革:(1)退休年龄的增长,和(2)普遍基本收入的增长。我们的成果表明,强化的学习算法可以极大地发展社会安全价值。