The analysis of population-wide datasets can provide insight on the health status of large populations so that public health officials can make data-driven decisions. The analysis of such datasets often requires highly parameterized models with different types of fixed and randoms effects to account for risk factors, spatial and temporal variations, multilevel effects and other sources on uncertainty. To illustrate the potential of Bayesian hierarchical models, a dataset of about 500 000 inhabitants released by the Polish National Health Fund containing information about ischemic stroke incidence for a 2-year period is analyzed using different types of models. Spatial logistic regression and survival models are considered for analyzing the individual probabilities of stroke and the times to the occurrence of an ischemic stroke event. Demographic and socioeconomic variables as well as drug prescription information are available at an individual level. Spatial variation is considered by means of region-level random effects.
翻译:对全人口数据集的分析可以使人们深入了解大量人口的健康状况,以便公共卫生官员能够根据数据作出决策。分析这类数据集往往需要具有不同类型固定和随机影响的高度参数化模型,以考虑到风险因素、空间和时间变化、多层次影响和不确定性的其他来源。为说明巴耶斯等级模型的潜力,波兰国家卫生基金利用不同类型的模型对包含两年内失血中风发生率信息的大约500 000名居民数据集进行了分析。空间后勤回归和生存模型被考虑用于分析中风的个别概率和发生失热事件的时间。人口和社会经济变量以及药物处方信息在个人层面上都有。通过区域随机影响来考虑空间变化。