Violence risk assessment in psychiatric institutions enables interventions to avoid violence incidents. Clinical notes written by practitioners and available in electronic health records are valuable resources capturing unique information, but are seldom used to their full potential. We explore conventional and deep machine learning methods to assess violence risk in psychiatric patients using practitioner notes. The performance of our best models is comparable to the currently used questionnaire-based method, with an area under the Receiver Operating Characteristic curve of approximately 0.8. We find that the deep-learning model BERTje performs worse than conventional machine learning methods. We also evaluate our data and our classifiers to understand the performance of our models better. This is particularly important for the applicability of evaluated classifiers to new data, and is also of great interest to practitioners, due to the increased availability of new data in electronic format.
翻译:精神病机构的暴力风险评估有助于采取干预措施,避免暴力事件的发生。从业者编写并在电子健康记录中提供的临床说明是收集独特信息的宝贵资源,但很少充分利用其潜力。我们探索常规和深层次的机器学习方法,利用从业者说明评估精神病患者的暴力风险。我们的最佳模型的性能可与目前使用的基于问卷的方法相比,其范围约为0.8。我们发现深层次学习的BERTje模型比常规机器学习方法要差。我们还评估我们的数据和分类人员,以便更好地了解我们模型的性能。这对于经过评估的分类人员对新数据的适用性特别重要,而且由于电子格式的新数据越来越多,对从业者也非常感兴趣。