Rapid developments in streaming data technologies are continuing to generate increased interest in monitoring human activity. Wearable devices, such as wrist-worn sensors that monitor gross motor activity (actigraphy), have become prevalent. An Actigraph unit continually records the activity level of an individual, producing a very large amount of data at a high-resolution that can be immediately downloaded and analyzed. While this kind of big data includes both spatial and temporal information, the variation in such data seems to be more appropriately modeled by considering stochastic evolution through time while accounting for spatial information separately. We propose a comprehensive Bayesian hierarchical modeling and inferential framework for actigraphy data reckoning with the massive sizes of such databases while attempting to offer full inference. Building upon recent developments in this field, we construct Nearest Neighbour Gaussian Processes(NNGPs) for actigraphy data to compute at large temporal scales. More specifically, we construct a temporal NNGP and we focus on the optimized implementation of the collapsed algorithm in this specific context. This approach permits improved model scaling while also offering full inference. We test and validate our methods on simulated data and subsequently apply and verify their predictive ability on an original data-set concerning a health study conducted by the Fielding School of Public Health of the University of California Los Angeles.
翻译:数据流技术的迅速发展正在继续促使人们更加关注监测人类活动。穿戴装置,例如手腕式传感器,监测运动总活动(活动法),已经变得很普遍。一个活性单位不断记录一个人的活动水平,以高分辨率制作大量数据,可以立即下载和分析这些数据。虽然这种大数据包括空间和时间信息,但这些数据的变异似乎更适宜地建模,方法是考虑时间的随机演化,同时单独核算空间信息。我们提议采用一个全面的巴耶斯分层建模和推断框架,根据这种数据库的庞大规模来计算活动数据。我们根据该领域的最新发展,建立近邻高斯进程(NNGPs),用于在大型时间尺度上进行校准数据。更具体地说,我们建立一个时间性NGP,我们注重在这种特定情况下优化地采用崩溃计算法。这个方法可以改进模型的缩放,同时提供充分的推断力。我们根据该领域的最新发展情况,测试和验证了我们大学的实地健康研究的原始数据。我们随后通过模拟和实地预测,对大学的健康状况进行了实地数据进行了一项预测。