With the advent of continuous health monitoring via wearable devices, users now generate their unique streams of continuous data such as minute-level physical activity or heart rate. Aggregating these streams into scalar summaries ignores the distributional nature of data and often leads to the loss of critical information. We propose to capture the distributional properties of wearable data via user-specific quantile functions that are further used in functional regression and multi-modal distributional modelling. In addition, we propose to encode user-specific distributional information with user-specific L-moments, robust rank-based analogs of traditional moments. Importantly, this L-moment encoding results in mutually consistent functional and distributional interpretation of the results of scalar-on-function regression. We also demonstrate how L-moments can be flexibly employed for analyzing joint and individual sources of variation in multi-modal distributional data. The proposed methods are illustrated in a study of association of accelerometry-derived digital gait biomarkers with Alzheimer's disease (AD) and in people with normal cognitive function. Our analysis shows that the proposed quantile-based representation results in a much higher predictive performance compared to simple distributional summaries and attains much stronger associations with clinical cognitive scales.
翻译:随着通过可磨损装置不断进行健康监测的出现,用户现在产生其独特的连续数据流,如微小的物理活动或心率等。将这些流汇总成缩略图,忽略了数据的分布性质,往往导致重要信息丢失。我们提议通过功能回归和多模式分布模型中进一步使用的用户特有的四分位函数,捕捉可磨损数据的分布特性。此外,我们提议用用户特有的时间点,强力的按等级排列的传统时刻模拟,将特定用户的分布信息编码为编码。重要的是,这种L-移动编码的结果是,对降幅-功能回归的结果进行相互一致的功能和分布解释。我们还表明,如何灵活使用L-moment来分析多模式分布数据差异的联合和个别来源。在一项研究中说明了拟议的方法,该研究将基于偏差的计量数字座标与阿尔茨海默氏病(AD)和具有正常认知功能的人联系起来。我们的分析显示,拟议的以缩略图为基础的缩图显示,与较强的临床缩略图相比,在较强的临床缩缩度上取得了较强的预测性表现。