Poverty is one of the fundamental issues that mankind faces. Multidimensional Poverty Index (MPI) is deployed for measuring poverty issues in a population beyond monetary. However, MPI cannot provide information regarding associations and causal relations among poverty factors. Does education cause income inequality in a specific region? Is lacking education a cause of health issues? By not knowing causal relations, policy maker cannot pinpoint root causes of poverty issues of a specific population, which might not be the same across different population. Additionally, MPI requires binary data, which cannot be analyzed by most of causal inference frameworks. In this work, we proposed an exploratory-data-analysis framework for finding possible causal relations with confidence intervals among binary data. The proposed framework provides not only how severe the issue of poverty is, but it also provides the causal relations among poverty factors. Moreover, knowing a confidence interval of degree of causal direction lets us know how strong a causal relation is. We evaluated the proposed framework with several baseline approaches in simulation datasets as well as using two real-world datasets as case studies 1) Twin births of the United States: the relation between birth weight and mortality of twin, and 2) Thailand population surveys from 378k households of Chiang Mai and 353k households of Khon Kaen provinces. Our framework performed better than baselines in most cases. The first case study reveals almost all mortality cases in twins have issues of low birth weights but not all low-birth-weight twins were died. The second case study reveals that smoking associates with drinking alcohol in both provinces and there is a causal relation of smoking causes drinking alcohol in only Chiang Mai province. The framework can be applied beyond the poverty context.
翻译:贫穷是人类面临的根本问题之一。 多层面贫穷指数(MPI)用于衡量人口在货币以外的贫困问题上的贫困问题。 但是, MPI无法提供有关协会和贫困因素之间因果关系的信息。 教育是否在特定区域造成收入不平等? 缺乏教育是否在特定区域造成健康问题? 缺乏教育是否是健康问题的原因之一? 政策制定者由于不了解因果关系,无法确定特定人口贫困问题的根源,而不同人口之间可能不同。 此外, MPI需要二进制数据,这些数据无法用大多数双胞胎双胞胎的因果关系框架加以分析。 在这项工作中,我们提议了一个探索性数据分析框架,以寻找可能与双胞胎之间信任度数据之间的因果关系。 拟议的框架不仅提供了贫困问题的严重性,而且还提供了贫困因素之间的因果关系。 此外,由于信任程度不同,政策制定者无法确定特定人口贫困问题的根源。 我们评估了拟议框架,模拟数据集中的若干基线方法,以及两个真实的双胞胎死亡率框架:(1) 美国的双胞胎:双胞胎和双胞胎死亡率之间的关系; 与泰国马氏河两省进行的大多数人口基准调查案例都显示, 。