International Classification of Diseases (ICD) is a set of classification codes for medical records. Automated ICD coding, which assigns unique International Classification of Diseases codes with each medical record, is widely used recently for its efficiency and error-prone avoidance. However, there are challenges that remain such as heterogeneity, label unbalance, and complex relationships between ICD codes. In this work, we proposed a novel Bidirectional Hierarchy Framework(HieNet) to address the challenges. Specifically, a personalized PageRank routine is developed to capture the co-relation of codes, a bidirectional hierarchy passage encoder to capture the codes' hierarchical representations, and a progressive predicting method is then proposed to narrow down the semantic searching space of prediction. We validate our method on two widely used datasets. Experimental results on two authoritative public datasets demonstrate that our proposed method boosts state-of-the-art performance by a large margin.
翻译:国际疾病分类(疾病分类)是一套医疗记录的分类编码。自动的ICD编码为每一医疗记录指定了独特的国际疾病分类编码,最近广泛用于提高效率和避免发生错误,然而,仍然存在一些挑战,例如异质性、标签不平衡和疾病分类编码之间的复杂关系。在这项工作中,我们提出了一个新的双向分级框架(HieNet)以应对挑战。具体地说,正在开发一个个性化的PageRank常规,以捕捉各种编码的共通关系,一种双向分级分级连接码,以捕捉编码的分级表示,然后提出一种渐进的预测方法以缩小预测的语义搜索空间。我们在两个广泛使用的数据集上验证了我们的方法。两个权威性公共数据集的实验结果表明,我们所提议的方法能大大提升最新水平的表现。