ICD自动编码多式机器学习 (Multimodal Machine Learning for Automated ICD Coding) - 专知论文

会员服务 ·

0

微F1 · 多模态机器学习 · Learning · 多峰值 · Automator ·

2022 年 9 月 1 日

Multimodal Machine Learning for Automated ICD Coding

翻译：ICD自动编码多式机器学习

Keyang Xu,Mike Lam,Jingzhi Pang,Xin Gao,Charlotte Band,Piyush Mathur,Frank Papay,Ashish K. Khanna,Jacek B. Cywinski,Kamal Maheshwari,Pengtao Xie,Eric Xing

from arxiv, Machine Learning for Healthcare 2019

This study presents a multimodal machine learning model to predict ICD-10 diagnostic codes. We developed separate machine learning models that can handle data from different modalities, including unstructured text, semi-structured text and structured tabular data. We further employed an ensemble method to integrate all modality-specific models to generate ICD-10 codes. Key evidence was also extracted to make our prediction more convincing and explainable. We used the Medical Information Mart for Intensive Care III (MIMIC -III) dataset to validate our approach. For ICD code prediction, our best-performing model (micro-F1 = 0.7633, micro-AUC = 0.9541) significantly outperforms other baseline models including TF-IDF (micro-F1 = 0.6721, micro-AUC = 0.7879) and Text-CNN model (micro-F1 = 0.6569, micro-AUC = 0.9235). For interpretability, our approach achieves a Jaccard Similarity Coefficient (JSC) of 0.1806 on text data and 0.3105 on tabular data, where well-trained physicians achieve 0.2780 and 0.5002 respectively.

翻译：这项研究提供了一种多式机器学习模型,以预测ICD-10诊断代码。我们开发了单独的机器学习模型,可以处理不同模式的数据,包括无结构文本、半结构文本和结构化表格数据。我们进一步采用了一种混合方法,整合所有特定模式模型,以生成ICD-10代码。还提取了关键证据,使我们的预测更加可信和可以解释。我们使用“三号强化护理医疗信息网”(MIMIMIC-III)数据集来验证我们的方法。对于ICD代码预测,我们最优秀的模型(Mro-F1=0.7633,微型-AUC=0.9541)大大优于其他基线模型,包括TF-IDF(1M-F1=0.6721,微型-AUC=0.7879)和Text-CNN模型(Micro-F1=0.6569,微型-AUC=0.9235),为了解释性,我们的方法在文本数据上达到了0.1806和表式数据上0.305,受过良好训练的医生分别达到0.270和0.5802。

0

相关内容

微F1

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

疏肝法对情绪调节不良MCI患者工作记忆影响的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

植物单宁影响布氏田鼠取食行为的神经机制

国家自然科学基金

0+阅读 · 2014年12月31日

PI3K-Akt-mTOR信号通路介导的自噬对脊髓损伤后神经元细胞凋亡的影响及其机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

肌萎缩侧索硬化临床亚型的认知心理学及神经影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

不同基因型（p53codon72）鼻咽癌细胞放射敏感性差异的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于氯通道差异性表达的Disulfiram-Cu靶向抗肿瘤作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

大电导钙激活钾离子通道(BK)的结构与功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

水通道蛋白4调节帕金森病不同亚群多巴胺能神经元损伤易感性差异的研究

国家自然科学基金

0+阅读 · 2011年12月31日

双人交互协同动作的识别和监测的研究

国家自然科学基金

0+阅读 · 2011年12月31日

CyclinE/Cdk2相关蛋白Ankrd17在细胞周期调控中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Multimodal Representation Learning on Graphs

Arxiv

0+阅读 · 2022年10月18日

Representation Theory for Geometric Quantum Machine Learning

Arxiv

0+阅读 · 2022年10月14日

Can Language Representation Models Think in Bets?

Arxiv

0+阅读 · 2022年10月14日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Deep Learning for Learning Graph Representations

Arxiv

35+阅读 · 2020年1月2日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

VIP会员

文章信息

相关主题

多模态机器学习

相关VIP内容

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

84+阅读 · 2022年3月19日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《反无人机：用于无人机探测与定位的多输入多输出雷达》最新69页

《生成式人工智能及其在防御性网络安全课程中的应用》

《全谱战争——从拓宽工具到思考不可思考之事》

《FPV武装无人机的战斗飞行艺术与科学》最新报告

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Multimodal Representation Learning on Graphs

Arxiv

0+阅读 · 2022年10月18日

Representation Theory for Geometric Quantum Machine Learning

Arxiv

0+阅读 · 2022年10月14日

Can Language Representation Models Think in Bets?

Arxiv

0+阅读 · 2022年10月14日

Geometric multimodal representation learning

Arxiv

69+阅读 · 2022年9月7日

Automated Graph Machine Learning: Approaches, Libraries and Directions

Arxiv

20+阅读 · 2022年1月4日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Deep Learning for Learning Graph Representations

Arxiv

35+阅读 · 2020年1月2日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

相关基金

疏肝法对情绪调节不良MCI患者工作记忆影响的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

植物单宁影响布氏田鼠取食行为的神经机制

国家自然科学基金

0+阅读 · 2014年12月31日

PI3K-Akt-mTOR信号通路介导的自噬对脊髓损伤后神经元细胞凋亡的影响及其机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

肌萎缩侧索硬化临床亚型的认知心理学及神经影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

不同基因型（p53codon72）鼻咽癌细胞放射敏感性差异的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于氯通道差异性表达的Disulfiram-Cu靶向抗肿瘤作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

大电导钙激活钾离子通道(BK)的结构与功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

水通道蛋白4调节帕金森病不同亚群多巴胺能神经元损伤易感性差异的研究

国家自然科学基金

0+阅读 · 2011年12月31日

双人交互协同动作的识别和监测的研究

国家自然科学基金

0+阅读 · 2011年12月31日

CyclinE/Cdk2相关蛋白Ankrd17在细胞周期调控中的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员