自动化医学编码在 MIMIC-III 和 MIMIC-IV 上的关键综述和可复制性研究 (Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study) - 专知论文

会员服务 ·

0

自动化 · 代码 · 分析 · 综述 · 模型比较 ·

2023 年 4 月 21 日

Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study

翻译：自动化医学编码在 MIMIC-III 和 MIMIC-IV 上的关键综述和可复制性研究

Joakim Edin,Alexander Junge,Jakob D. Havtorn,Lasse Borgholt,Maria Maistro,Tuukka Ruotsalo,Lars Maaløe

from arxiv, 11 pages, 6 figures, to be published in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23), July 23--27, 2023, Taipei, Taiwan

Medical coding is the task of assigning medical codes to clinical free-text documentation. Healthcare professionals manually assign such codes to track patient diagnoses and treatments. Automated medical coding can considerably alleviate this administrative burden. In this paper, we reproduce, compare, and analyze state-of-the-art automated medical coding machine learning models. We show that several models underperform due to weak configurations, poorly sampled train-test splits, and insufficient evaluation. In previous work, the macro F1 score has been calculated sub-optimally, and our correction doubles it. We contribute a revised model comparison using stratified sampling and identical experimental setups, including hyperparameters and decision boundary tuning. We analyze prediction errors to validate and falsify assumptions of previous works. The analysis confirms that all models struggle with rare codes, while long documents only have a negligible impact. Finally, we present the first comprehensive results on the newly released MIMIC-IV dataset using the reproduced models. We release our code, model parameters, and new MIMIC-III and MIMIC-IV training and evaluation pipelines to accommodate fair future comparisons.

翻译：医学编码是将医疗文本记录分配给医学代码的任务。医疗保健专业人员手动分配这些代码以跟踪患者的诊断和治疗。自动化医学编码可以显着减轻这种行政负担。在本文中，我们复制、比较和分析了最新的自动化医学编码机器学习模型。我们表明，由于配置弱、训练测试分割不充分以及评估不足，几个模型表现不佳。在以前的工作中，宏 F1 分数的计算被进行了亚优化，而我们的更正将其加倍了。我们通过分层抽样和相同的实验设置，包括超参数和决策边界调整，做出了修订后的模型比较。我们分析预测误差以验证和证伪以前工作的假设。分析确认所有模型都面临稀有代码的困扰，而长文档仅有微不足道的影响。最后，我们展示了使用复制的模型对新发布的 MIMIC-IV 数据集的首个全面结果。我们发布了我们的代码、模型参数和新的 MIMIC-III 和 MIMIC-IV 训练和评估管道，以适应未来公平的比较。

0

相关内容

自动化

机器或装置在无人干预的情况下按规定的程序或指令自动进行操作或控制的过程，是一门涉及学科较多、应用广泛的综合性科学技术。

【MIT-AI+医学课程】面向生命科学的深度学习课程

【MIT-AI+医学课程】面向生命科学的深度学习课程

专知会员服务

49+阅读 · 2022年4月17日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【牛津大学】电子医疗记录的生成式对抗网络:应用、评估措施和数据来源综述，A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

【牛津大学】电子医疗记录的生成式对抗网络:应用、评估措施和数据来源综述，A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

专知会员服务

24+阅读 · 2022年3月15日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

EB病毒BARTs miRNA在鼻咽癌发生发展中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

烟草致癌物NNK诱发长链非编码RNA基因突变在肺癌发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

atp7b基因外显子编码区变异致mRNA异常剪接的致病机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Numbl-TRAF6-TAB2对NF-kappa B活性的调节在小胶质细胞炎性活化中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

基于热休克同源蛋白70的miRNA调控网络的建立及其在WSSV侵染宿主过程的功能和机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

白斑综合症病毒编码的microRNA在病毒感染凡纳滨对虾过程中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

利用SGZ神经再生促进小鼠颞叶癫痫慢性期海马神经构筑修复和症状改善的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新基因CIAPIN1的功能研究：在肾脏肿瘤增殖中的生物作用及分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

MeCP2基因及其所在染色体Xq28区域基因序列重复在孤独症发病机制中的作用研究

国家自然科学基金

1+阅读 · 2008年12月31日

Development and Analysis of P2SCP: A Paradigm for Penetration Testing of Systems that Cannot be Subjected to the Risk of Penetration Testing

Arxiv

0+阅读 · 2023年6月7日

CRS-FL: Conditional Random Sampling for Communication-Efficient and Privacy-Preserving Federated Learning

Arxiv

0+阅读 · 2023年6月7日

Temporal-spatial Correlation Attention Network for Clinical Data Analysis in Intensive Care Unit

Arxiv

0+阅读 · 2023年6月3日

A systematic literature review on the code smells datasets and validation mechanisms

Arxiv

0+阅读 · 2023年6月2日

Rethinking Model Evaluation as Narrowing the Socio-Technical Gap

Arxiv

0+阅读 · 2023年6月1日

Deep Learning for Medical Image Segmentation: Tricks, Challenges and Future Directions

Arxiv

21+阅读 · 2022年9月21日

Learning with Limited Annotations: A Survey on Deep Semi-Supervised Learning for Medical Image Segmentation

Learning with Limited Annotations: A Survey on Deep Semi-Supervised Learning for Medical Image Segmentation

Arxiv

13+阅读 · 2022年7月28日

Transformers in Medical Image Analysis: A Review

Transformers in Medical Image Analysis: A Review

Arxiv

40+阅读 · 2022年2月24日

Deep learning for cardiac image segmentation: A review

Deep learning for cardiac image segmentation: A review

Arxiv

21+阅读 · 2019年11月9日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

相关VIP内容

【MIT-AI+医学课程】面向生命科学的深度学习课程

【MIT-AI+医学课程】面向生命科学的深度学习课程

专知会员服务

49+阅读 · 2022年4月17日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【牛津大学】电子医疗记录的生成式对抗网络:应用、评估措施和数据来源综述，A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

【牛津大学】电子医疗记录的生成式对抗网络:应用、评估措施和数据来源综述，A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

专知会员服务

24+阅读 · 2022年3月15日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

相关论文

Development and Analysis of P2SCP: A Paradigm for Penetration Testing of Systems that Cannot be Subjected to the Risk of Penetration Testing

Arxiv

0+阅读 · 2023年6月7日

CRS-FL: Conditional Random Sampling for Communication-Efficient and Privacy-Preserving Federated Learning

Arxiv

0+阅读 · 2023年6月7日

Temporal-spatial Correlation Attention Network for Clinical Data Analysis in Intensive Care Unit

Arxiv

0+阅读 · 2023年6月3日

A systematic literature review on the code smells datasets and validation mechanisms

Arxiv

0+阅读 · 2023年6月2日

Rethinking Model Evaluation as Narrowing the Socio-Technical Gap

Arxiv

0+阅读 · 2023年6月1日

Deep Learning for Medical Image Segmentation: Tricks, Challenges and Future Directions

Arxiv

21+阅读 · 2022年9月21日

Learning with Limited Annotations: A Survey on Deep Semi-Supervised Learning for Medical Image Segmentation

Learning with Limited Annotations: A Survey on Deep Semi-Supervised Learning for Medical Image Segmentation

Arxiv

13+阅读 · 2022年7月28日

Transformers in Medical Image Analysis: A Review

Transformers in Medical Image Analysis: A Review

Arxiv

40+阅读 · 2022年2月24日

Deep learning for cardiac image segmentation: A review

Deep learning for cardiac image segmentation: A review

Arxiv

21+阅读 · 2019年11月9日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

EB病毒BARTs miRNA在鼻咽癌发生发展中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

烟草致癌物NNK诱发长链非编码RNA基因突变在肺癌发生中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

atp7b基因外显子编码区变异致mRNA异常剪接的致病机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Numbl-TRAF6-TAB2对NF-kappa B活性的调节在小胶质细胞炎性活化中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

基于热休克同源蛋白70的miRNA调控网络的建立及其在WSSV侵染宿主过程的功能和机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

白斑综合症病毒编码的microRNA在病毒感染凡纳滨对虾过程中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

利用SGZ神经再生促进小鼠颞叶癫痫慢性期海马神经构筑修复和症状改善的研究

国家自然科学基金

0+阅读 · 2011年12月31日

新基因CIAPIN1的功能研究：在肾脏肿瘤增殖中的生物作用及分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

MeCP2基因及其所在染色体Xq28区域基因序列重复在孤独症发病机制中的作用研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员