Mobile technology enables unprecedented continuous monitoring of an individual's behavior, social interactions, symptoms, and other health conditions, presenting an enormous opportunity for therapeutic advancements and scientific discoveries regarding the etiology of psychiatric illness. Continuous collection of mobile data results in the generation of a new type of data: entangled multivariate time series of outcome, exposure, and covariates. Missing data is a pervasive problem in biomedical and social science research, and the Ecological Momentary Assessment (EMA) using mobile devices in psychiatric research is no exception. However, the complex structure of multivariate time series introduces new challenges in handling missing data for proper causal inference. Data imputation is commonly recommended to enhance data utility and estimation efficiency. The majority of available imputation methods are either designed for longitudinal data with limited follow-up times or for stationary time series, which are incompatible with potentially non-stationary time series. In the field of psychiatry, non-stationary data are frequently encountered as symptoms and treatment regimens may experience dramatic changes over time. To address missing data in possibly non-stationary multivariate time series, we propose a novel multiple imputation strategy based on the state space model (SSMmp) and a more computationally efficient variant (SSMimpute). We demonstrate their advantages over other widely used missing data strategies by evaluating their theoretical properties and empirical performance in simulations of both stationary and non-stationary time series, subject to various missing mechanisms. We apply the SSMimpute to investigate the association between social network size and negative mood using a multi-year observational smartphone study of bipolar patients, controlling for confounding variables.
翻译:移动技术使得能够对个人的行为、社会互动、症状和其他健康状况进行史无前例的持续监测,从而为治疗进步和精神病病理学科学发现提供了巨大的机会。不断收集移动数据的结果可以产生一种新的数据类型:结果、接触和共变的多变时间序列。在生物医学和社会科学研究中,缺少数据是一个普遍的问题,在精神病研究中使用移动设备进行生态动力评估(EMA ) 也不例外。然而,多变时间序列的复杂结构为处理缺失数据以进行适当的因果推断带来了新的挑战。数据估算通常建议提高数据效用和估计效率。现有的估算方法大多是设计用于具有有限后续时间或固定时间序列的纵向数据。在精神病学领域,由于症状和治疗疗程可能随时间变化而发生急剧变化,多变时间序列的复杂结构为处理缺失数据提供了新的挑战。在可能的非固定多变数时间序列中,数据估算通常建议提高数据的效用和估计效率。我们建议,在智能时间序列中采用新的多变化战略,通过系统进行新的多变换的系统,通过系统进行新的空间变现性分析,以显示其演化的系统。