Time-varying covariates are often available in survival studies and estimation of the hazard function needs to be updated as new information becomes available. In this paper, we investigate several different easy-to-implement ways that random forests can be used for dynamic estimation of the survival or hazard function from discrete-time survival data. The results from a simulation study indicate that all methods can perform well, and that none dominates the others. In general, situations that are more difficult from an estimation point of view (such as weaker signals and less data) favour a global fit, pooling over all time points, while situations that are easier from an estimation point of view (such as stronger signals and more data) favor local fits.
翻译:在生存研究和危险功能估计中,往往有时间变化的共变法,随着新的信息出现,需要更新。在本文件中,我们从离散的存活数据中调查了随机森林可用于动态估计生存或危险功能的几种不同的容易实施的方法。模拟研究的结果显示,所有方法都能很好地发挥作用,其他方法无一处于主导地位。一般而言,从估计观点看,比较困难的情况(如信号较弱和数据较少)更有利于全球,汇集到所有的时间点,而从估计观点看,比较容易估计的情况(如更强的信号和更多的数据)则适合当地情况。