Multistate Markov models are a canonical parametric approach for data modeling of observed or latent stochastic processes supported on a finite state space. Continuous-time Markov processes describe data that are observed irregularly over time, as is often the case in longitudinal medical and biological data sets, for example. Assuming that a continuous-time Markov process is time-homogeneous, a closed-form likelihood function can be derived from the Kolmogorov forward equations -- a system of differential equations with a well-known matrix-exponential solution. Unfortunately, however, the forward equations do not admit an analytical solution for continuous-time, time-inhomogeneous Markov processes, and so researchers and practitioners often make the simplifying assumption that the process is piecewise time-homogeneous. In this paper, we provide intuitions and illustrations of the potential biases for parameter estimation that may ensue in the more realistic scenario that the piecewise-homogeneous assumption is violated, and we advocate for a solution for likelihood computation in a truly time-inhomogeneous fashion. Particular focus is afforded to the context of multistate Markov models that allow for state label misclassifications, which applies more broadly to hidden Markov models (HMMs), and Bayesian computations bypass the necessity for computationally demanding numerical gradient approximations for obtaining maximum likelihood estimates (MLEs).
翻译:多 State Markov 模型是用于在有限空间支持的观测或潜在随机过程的数据建模的典型比方。 持续时间的Markov 进程描述的是长期不定期观测的数据,例如纵向医疗和生物数据集中经常出现的情况。 假设连续时间的Markov 进程是时间和多变性的,从科尔莫戈罗夫前方方方程中可以得出封闭形式的可能性功能 -- -- 一种差异方程系统,有广为人知的矩阵扩展解决方案。 但不幸的是,远方方程并不接受对连续时间、时间和不相容的Markov 进程的分析解决方案,因此研究人员和从业人员往往作出简化的假设,即该过程是零碎时间和多变性的。 在本文中,我们提供了对参数估计的潜在偏向直的直观和说明,在更现实的假设中,小相偏向的假设被违反,我们主张在真正时间和多极化的基度模型中进行可能的计算。 具体的重点是让IMMI的计算成为了最隐性成本的基值模型, 。