Medical automatic diagnosis aims to imitate human doctors in real-world diagnostic processes and to achieve accurate diagnoses by interacting with the patients. The task is formulated as a sequential decision-making problem with a series of symptom inquiring steps and the final diagnosis. Recent research has studied incorporating reinforcement learning for symptom inquiring and classification techniques for disease diagnosis, respectively. However, studies on efficiently and effectively combining the two procedures are still lacking. To address this issue, we devise an adaptive mechanism to align reinforcement learning and classification methods using distribution entropy as the medium. Additionally, we created a new dataset for patient simulation to address the lacking of large-scale evaluation benchmarks. The dataset is extracted from the MedlinePlus knowledge base and contains significantly more diseases and more comprehensive symptoms and examination information than existing datasets. Experimental evaluation shows that our method outperforms three current state-of-the-art methods on different datasets by achieving higher medical diagnosis accuracy with fewer inquiring turns.
翻译:医学自动诊断旨在模仿现实世界诊断过程中的人类医生,并通过与病人互动实现准确诊断; 任务是一个顺序决策问题,有一系列症状询问步骤和最终诊断; 最近的研究分别纳入了症状查询和疾病诊断分类技术的强化学习; 然而,关于这两个程序的高效和有效结合的研究仍然缺乏。 为解决这一问题,我们设计了一个适应机制,用分布酶作为媒介来协调强化学习和分类方法。 此外,我们为患者模拟建立了一个新的数据集,以解决缺乏大规模评估基准的问题。 数据集是从梅德利普斯知识库提取的,其中含有比现有数据集更多的疾病和更全面的症状及检查信息。 实验性评估表明,我们的方法通过提高医学诊断的准确性,减少调查的转机率,从而优于目前不同数据集的三种最先进的方法。