Studying the determinants of adverse pregnancy outcomes like stillbirth and preterm birth is of considerable interest in epidemiology. Understanding the role of both individual and community risk factors for these outcomes is crucial for planning appropriate clinical and public health interventions. With this goal, we develop geospatial mixed effects logistic regression models for adverse pregnancy outcomes. Our models account for both spatial autocorrelation and heterogeneity between neighborhoods. To mitigate the low incidence of stillbirth and preterm births in our data, we explore using class rebalancing techniques to improve predictive power. To assess the informative value of the covariates in our models, we use posterior distributions of their coefficients to gauge how well they can be distinguished from zero. As a case study, we model stillbirth and preterm birth in the city of Philadelphia, incorporating both patient-level data from electronic health records (EHR) data and publicly available neighborhood data at the census tract level. We find that patient-level features like self-identified race and ethnicity were highly informative for both outcomes. Neighborhood-level factors were also informative, with poverty important for stillbirth and crime important for preterm birth. Finally, we identify the neighborhoods in Philadelphia at highest risk of stillbirth and preterm birth.
翻译:研究死胎和早产等不利怀孕结果的决定因素对流行病学具有很大的兴趣。了解个人和社区风险因素对于这些结果的作用对于规划适当的临床和公共卫生干预措施至关重要。我们为此开发了有利于不利怀孕结果的地理间混合效应后勤回归模型。我们的模型既说明了社区之间的空间自动关系和异质性。为了减轻数据中死胎和早产的低发生率,我们探索了使用阶级再平衡技术来改善预测力。为了评估我们模型中的共变值的信息价值,我们使用其系数的后部分布来测量它们与零的区别。我们利用费城的死胎和早产模型进行案例研究,将电子健康记录(EHR)数据中的病人一级数据以及普查道一级的公众可得到的邻里数据都纳入其中。我们发现,病人一级的特征,如自我确定的种族和族裔,对两种结果都具有高度的知情性。内邻里比因素也具有信息性,对死前出生和出生前出生前的犯罪都很重要。最后,我们发现,在费城区发现,在分娩前的贫困风险最高。