利用行政数据进行比阿斯审计:评估COVID-19政策与流动数据不一致的覆盖面 (Leveraging Administrative Data for Bias Audits: Assessing Disparate Coverage with Mobility Data for COVID-19 Policy) - 专知论文

会员服务 ·

0

有偏 · COVID-19 · 真实值 · INFORMS · Less ·

2021 年 4 月 16 日

Leveraging Administrative Data for Bias Audits: Assessing Disparate Coverage with Mobility Data for COVID-19 Policy

翻译：利用行政数据进行比阿斯审计:评估COVID-19政策与流动数据不一致的覆盖面

Amanda Coston,Neel Guha,Derek Ouyang,Lisa Lu,Alexandra Chouldechova,Daniel E. Ho

Anonymized smartphone-based mobility data has been widely adopted in devising and evaluating COVID-19 response strategies such as the targeting of public health resources. Yet little attention has been paid to measurement validity and demographic bias, due in part to the lack of documentation about which users are represented as well as the challenge of obtaining ground truth data on unique visits and demographics. We illustrate how linking large-scale administrative data can enable auditing mobility data for bias in the absence of demographic information and ground truth labels. More precisely, we show that linking voter roll data -- containing individual-level voter turnout for specific voting locations along with race and age -- can facilitate the construction of rigorous bias and reliability tests. These tests illuminate a sampling bias that is particularly noteworthy in the pandemic context: older and non-white voters are less likely to be captured by mobility data. We show that allocating public health resources based on such mobility data could disproportionately harm high-risk elderly and minority groups.

翻译：在制定和评价COVID-19应对战略(如针对公共卫生资源)时,广泛采用了匿名智能手机流动数据,但很少注意衡量有效性和人口偏差,部分原因是缺乏关于用户代表的文件,以及难以获得关于独特访问和人口统计的地面真相数据。我们说明大规模行政数据如何可以将流动数据与缺乏人口信息和地面真相标签的偏差进行审计。更确切地说,我们表明,将选民名册数据 -- -- 包括特定投票地点的个人投票率以及种族和年龄 -- -- 联系起来,有助于构建严格的偏差和可靠性测试。这些测试揭示了在大流行病背景下特别值得注意的抽样偏差:流动数据不太可能捕捉到老年和非白人选民。我们表明,根据这种流动数据分配公共卫生资源可能会对高风险老年人和少数群体造成极大伤害。

0

相关内容

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

专知会员服务

13+阅读 · 2019年8月27日

【VLDB2019 tutorial】数据管理：机遇与挑战 Data Lake Management: Challenges and Opportunities，多伦多大学|Fatemeh Nargesian，微软|祝尔康

【VLDB2019 tutorial】数据管理：机遇与挑战 Data Lake Management: Challenges and Opportunities，多伦多大学|Fatemeh Nargesian，微软|祝尔康

专知会员服务

10+阅读 · 2019年8月27日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【推荐】Kaggle机器学习数据集推荐

【推荐】Kaggle机器学习数据集推荐

机器学习研究会

8+阅读 · 2017年11月19日

Understanding Privacy Attitudes and Concerns Towards Remote Communications During the COVID-19 Pandemic

Arxiv

0+阅读 · 2021年6月9日

Geospatial Reasoning with Shapefiles for Supporting Policy Decisions

Arxiv

0+阅读 · 2021年6月9日

Investigating label suggestions for opinion mining in German Covid-19 social media

Arxiv

0+阅读 · 2021年6月8日

A Distance Covariance-based Kernel for Nonlinear Causal Clustering in Heterogeneous Populations

Arxiv

0+阅读 · 2021年6月7日

Effect of Adaptive and Fixed Shared Steering Control on Distracted Driver Behavior

Arxiv

0+阅读 · 2021年6月7日

Estimating the number of entities with vacancies using administrative and online data

Arxiv

0+阅读 · 2021年6月6日

Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

Arxiv

0+阅读 · 2021年6月6日

Simulation Framework for Realistic Large-scale Individual-level Data Generation with an Application in the Health Domain

Arxiv

0+阅读 · 2021年6月5日

Impact of data-splits on generalization: Identifying COVID-19 from cough and context

Arxiv

0+阅读 · 2021年6月5日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

VIP会员

文章信息

相关主题

相关VIP内容

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

【VLDB2019 tutorial】Combating Fake News: A Data Management and Mining Perspective，不列颠哥伦比亚大|Laks V.S. Lakshmanan，Michael Simpson，Sara Thirumuruganathan，156页PDF

专知会员服务

13+阅读 · 2019年8月27日

【VLDB2019 tutorial】数据管理：机遇与挑战 Data Lake Management: Challenges and Opportunities，多伦多大学|Fatemeh Nargesian，微软|祝尔康

【VLDB2019 tutorial】数据管理：机遇与挑战 Data Lake Management: Challenges and Opportunities，多伦多大学|Fatemeh Nargesian，微软|祝尔康

专知会员服务

10+阅读 · 2019年8月27日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

【TED】什么让我们生病

【TED】什么让我们生病

英语演讲视频每日一推

7+阅读 · 2019年1月23日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

【推荐】Kaggle机器学习数据集推荐

【推荐】Kaggle机器学习数据集推荐

机器学习研究会

8+阅读 · 2017年11月19日

相关论文

Understanding Privacy Attitudes and Concerns Towards Remote Communications During the COVID-19 Pandemic

Arxiv

0+阅读 · 2021年6月9日

Geospatial Reasoning with Shapefiles for Supporting Policy Decisions

Arxiv

0+阅读 · 2021年6月9日

Investigating label suggestions for opinion mining in German Covid-19 social media

Arxiv

0+阅读 · 2021年6月8日

A Distance Covariance-based Kernel for Nonlinear Causal Clustering in Heterogeneous Populations

Arxiv

0+阅读 · 2021年6月7日

Effect of Adaptive and Fixed Shared Steering Control on Distracted Driver Behavior

Arxiv

0+阅读 · 2021年6月7日

Estimating the number of entities with vacancies using administrative and online data

Arxiv

0+阅读 · 2021年6月6日

Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

Arxiv

0+阅读 · 2021年6月6日

Simulation Framework for Realistic Large-scale Individual-level Data Generation with an Application in the Health Domain

Arxiv

0+阅读 · 2021年6月5日

Impact of data-splits on generalization: Identifying COVID-19 from cough and context

Arxiv

0+阅读 · 2021年6月5日

On Feature Normalization and Data Augmentation

On Feature Normalization and Data Augmentation

Arxiv

15+阅读 · 2020年2月25日

微信扫码咨询专知VIP会员