Methods and Materials: We investigated transferability of neural network-based de-identification sys-tems with and without domain generalization. We used two domain generalization approaches: a novel approach Joint-Domain Learning (JDL) as developed in this paper, and a state-of-the-art domain general-ization approach Common-Specific Decomposition (CSD) from the literature. First, we measured trans-ferability from a single external source. Second, we used two external sources and evaluated whether domain generalization can improve transferability of de-identification models across domains which rep-resent different note types from the same institution. Third, using two external sources with in-domain training data, we studied whether external source data are useful even in cases where sufficient in-domain training data are available. Finally, we investigated transferability of the de-identification mod-els across institutions. Results and Conclusions: We found transferability from a single external source gave inconsistent re-sults. Using additional external sources consistently yielded an F1-score of approximately 80%, but domain generalization was not always helpful to improve transferability. We also found that external sources were useful even in cases where in-domain training data were available by reducing the amount of needed in-domain training data or by improving performance. Transferability across institutions was differed by note type and annotation label. External sources from a different institution were also useful to further improve performance.
翻译:方法与材料:我们调查了基于神经网络的去识别系统系统的可转让性;我们调查了基于神经网络的去识别系统系统的可转让性;我们使用两种领域通用方法:本文中制定的新颖方法 " 联合学习 " (JDL),以及文献中采用的最新领域通用方法 " 共同具体分解(CSD) " ;首先,我们从单一外部来源测量了从单一外部来源的可转让性;第二,我们使用两个外部来源,并评价了域通用性是否能改进在同一机构代表不同批注类型的不同领域的去识别模型的可转让性。第三,我们使用两个具有内部培训数据的外部来源,我们研究了外部来源是否有用,即便在具备足够的内部培训数据的情况下,也研究了外部来源是否有用;通过不同类型培训,我们发现外部来源的可转让性得到了改进。