探索多式临床诊断性NLP中具体文本和黑盒子公平标准 (Exploring Text Specific and Blackbox Fairness Algorithms in Multimodal Clinical NLP) - 专知论文

会员服务 ·

0

Facebook AI Research · 多峰值 · NLP · Performer · 词向量表示 ·

2020 年 11 月 19 日

Exploring Text Specific and Blackbox Fairness Algorithms in Multimodal Clinical NLP

翻译：探索多式临床诊断性NLP中具体文本和黑盒子公平标准

John Chen,Ian Berlot-Atwell,Safwan Hossain,Xindi Wang,Frank Rudzicz

from arxiv, Best paper award at 3rd Clinical Natural Language Processing Workshop at EMNLP 2020

Clinical machine learning is increasingly multimodal, collected in both structured tabular formats and unstructured forms such as freetext. We propose a novel task of exploring fairness on a multimodal clinical dataset, adopting equalized odds for the downstream medical prediction tasks. To this end, we investigate a modality-agnostic fairness algorithm - equalized odds post processing - and compare it to a text-specific fairness algorithm: debiased clinical word embeddings. Despite the fact that debiased word embeddings do not explicitly address equalized odds of protected groups, we show that a text-specific approach to fairness may simultaneously achieve a good balance of performance and classical notions of fairness. We hope that our paper inspires future contributions at the critical intersection of clinical NLP and fairness. The full source code is available here: https://github.com/johntiger1/multimodal_fairness

翻译：临床机学日益采用多式形式,以结构化表格形式和自由文本等非结构化形式收集。我们提出探索多式联运临床数据集的公平性的新任务,对下游医疗预测任务采用均等的几率。为此,我们调查一种模式的、不可知的公平算法,即同等的事后处理,并将其与文本特有的公平算法进行比较:有偏向的临床词嵌入。尽管有偏向的字嵌入没有明确地解决受保护群体同等的几率问题,但我们表明,针对文本的公平方法可以同时实现业绩和传统公平概念之间的良好平衡。我们希望我们的论文能够激发今后在临床NLP和公平的关键交叉点上的贡献。这里有完整的源代码:https://github.com/johntiger1/muldmodal_fairity。这里有:https://github.modal_fairity。

0

相关内容

Facebook AI Research

Facebook AI Research

Facebook AI Research

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

【AAAI2020-清华大学】张量图卷积网络文本分类，Tensor Graph Convolutional Networks for Text Classification

【AAAI2020-清华大学】张量图卷积网络文本分类，Tensor Graph Convolutional Networks for Text Classification

专知会员服务

76+阅读 · 2020年1月16日

《机器学习与公平性》（Fairness and Machine Learning）新书发布，附181页PDF下载

《机器学习与公平性》（Fairness and Machine Learning）新书发布，附181页PDF下载

专知会员服务

78+阅读 · 2019年10月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

将门创投

11+阅读 · 2019年7月4日

Principal Fairness for Human and Algorithmic Decision-Making

Arxiv

0+阅读 · 2021年1月14日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

ERNIE: Enhanced Language Representation with Informative Entities

Arxiv

5+阅读 · 2019年5月17日

Transfer Topic Labeling with Domain-Specific Knowledge Base: An Analysis of UK House of Commons Speeches 1935-2014

Arxiv

3+阅读 · 2018年6月3日

Quantifying the visual concreteness of words and topics in multimodal datasets

Arxiv

5+阅读 · 2018年5月23日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Arxiv

4+阅读 · 2018年4月26日

Fine-tuned Language Models for Text Classification

Arxiv

5+阅读 · 2018年1月18日

What Level of Quality can Neural Machine Translation Attain on Literary Text?

Arxiv

5+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

Facebook AI Research

词向量表示

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

【AAAI2020-清华大学】张量图卷积网络文本分类，Tensor Graph Convolutional Networks for Text Classification

【AAAI2020-清华大学】张量图卷积网络文本分类，Tensor Graph Convolutional Networks for Text Classification

专知会员服务

76+阅读 · 2020年1月16日

《机器学习与公平性》（Fairness and Machine Learning）新书发布，附181页PDF下载

《机器学习与公平性》（Fairness and Machine Learning）新书发布，附181页PDF下载

专知会员服务

78+阅读 · 2019年10月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

已删除

将门创投

11+阅读 · 2019年7月4日

相关论文

Principal Fairness for Human and Algorithmic Decision-Making

Arxiv

0+阅读 · 2021年1月14日

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Arxiv

5+阅读 · 2020年12月21日

Text Summarization with Pretrained Encoders

Arxiv

5+阅读 · 2019年8月22日

ERNIE: Enhanced Language Representation with Informative Entities

Arxiv

5+阅读 · 2019年5月17日

Transfer Topic Labeling with Domain-Specific Knowledge Base: An Analysis of UK House of Commons Speeches 1935-2014

Arxiv

3+阅读 · 2018年6月3日

Quantifying the visual concreteness of words and topics in multimodal datasets

Arxiv

5+阅读 · 2018年5月23日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

Lessons from the Bible on Modern Topics: Low-Resource Multilingual Topic Model Evaluation

Arxiv

4+阅读 · 2018年4月26日

Fine-tuned Language Models for Text Classification

Arxiv

5+阅读 · 2018年1月18日

What Level of Quality can Neural Machine Translation Attain on Literary Text?

Arxiv

5+阅读 · 2018年1月15日

微信扫码咨询专知VIP会员