Object-oriented data analysis is a fascinating and developing field in modern statistical science with the potential to make significant and valuable contributions to biomedical applications. This statistical framework allows for the formalization of new methods to analyze complex data objects that capture more information than traditional clinical biomarkers. The paper applies the object-oriented framework to analyzing and predicting physical activity measured by accelerometers. As opposed to traditional summary metrics, we utilize a recently proposed representation of physical activity data as a distributional object, providing a more sophisticated and complete profile of individual energetic expenditure in all ranges of monitoring intensity. For the purpose of predicting these distributional objects, we propose a novel hybrid Frechet regression model and apply it to US population accelerometer data from NHANES 2011-2014. The semi-parametric character of the new model allows us to introduce non-linear effects for essential variables, such as age, that are known from a biological point of view to have nuanced effects on physical activity. At the same time, the inclusion of a global for linear term retains the advantage of interpretability for other variables, particularly categorical covariates such as ethnicity and sex. The results obtained in our analysis are helpful from a public health perspective and may lead to new strategies for optimizing physical activity interventions in specific American subpopulations.
翻译:以物体为导向的数据分析是现代统计科学中一个令人着迷和不断发展的领域,有可能为生物医学应用做出重要和有价值的贡献。这一统计框架使分析收集比传统的临床生物标志更多的信息的复杂数据对象的新方法正规化,该文件应用了以物体为导向的框架来分析和预测以加速度计测量的物理活动。与传统的简要指标相反,我们利用最近提出的将体育活动数据作为分布性物体的表述方式,在所有监测强度范围内提供个人精力充沛支出的更精密和完整的概况。为预测这些分布性物体的目的,我们提出一个新的混合Frechet回归模型,并将其适用于美国人口加速度计数据,从2011-2014年NHANES到2014年。新模型的半参数性质使我们能够对基本变量(例如年龄)产生非线性影响,从生物角度认识这些数据对物理活动有细微的影响。同时,在线性术语中纳入个人精力支出的全称保留了其他变量的解释性优势,特别是直线性可变数,如种族和性别等。在优化的物理活动中,从公共卫生战略中获得的有益结果。