Importance: Social determinants of health (SDOH) are known to be associated with increased risk of suicidal behaviors, but few studies utilized SDOH from unstructured electronic health record (EHR) notes. Objective: To investigate associations between suicide and recent SDOH, identified using structured and unstructured data. Design: Nested case-control study. Setting: EHR data from the US Veterans Health Administration (VHA). Participants: 6,122,785 Veterans who received care in the US VHA between October 1, 2010, and September 30, 2015. Exposures: Occurrence of SDOH over a maximum span of two years compared with no occurrence of SDOH. Main Outcomes and Measures: Cases of suicide deaths were matched with 4 controls on birth year, cohort entry date, sex, and duration of follow-up. We developed an NLP system to extract SDOH from unstructured notes. Structured data, NLP on unstructured data, and combining them yielded seven, eight and nine SDOH respectively. Adjusted odds ratios (aORs) and 95% confidence intervals (CIs) were estimated using conditional logistic regression. Results: In our cohort, 8,821 Veterans committed suicide during 23,725,382 person-years of follow-up (incidence rate 37.18 /100,000 person-years). Our cohort was mostly male (92.23%) and white (76.99%). Across the six common SDOH as covariates, NLP-extracted SDOH, on average, covered 84.38% of all SDOH occurrences. All SDOH, measured by structured data and NLP, were significantly associated with increased risk of suicide. The SDOH with the largest effects was legal problems (aOR=2.67, 95% CI=2.46-2.89), followed by violence (aOR=2.26, 95% CI=2.11-2.43). NLP-extracted and structured SDOH were also associated with suicide. Conclusions and Relevance: NLP-extracted SDOH were always significantly associated with increased risk of suicide among Veterans, suggesting the potential of NLP in public health studies.
翻译:重要之处:已知健康的社会决定因素(SDOH)与自杀行为的风险增加有关,但很少有研究利用未经结构化的电子健康记录(EHR)中的SDOH。目标:调查自杀事件与最近的SDOH之间的关联,使用结构化和无结构化的数据进行确认。设计:内嵌的病例控制研究。设置:美国退伍军人健康管理局(VHA)提供的EHR数据。参与者:2010年10月1日至2015年9月30日期间在美国健康管理局接受护理的6,122,785名退伍军人(SDOH)与无结构化电子健康记录有明显的联系,但接触:SDOH在最长的两年时间里发生,而没有SDOH记录。主要结果和措施:自杀死亡案例与出生年、组开始日期、性以及后续跟踪的4,4,728人(SDOH=ODA)中,有结构化数据、不结构化数据的NLP(SDOHA)分别有7,8和9个男性健康(SDOA),根据SD-OD(SD-SD) 203中的最低比率(SD) 和95(SDODODOD)的概率间隔期增加。