Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may become impractical, and the clarity of the representation is lost. Clustering of variables is a natural way to reduce the size of the causal diagram but it may erroneously change the essential properties of the causal relations if implemented arbitrarily. We define a specific type of cluster, called transit cluster, that is guaranteed to preserve the identifiability properties of causal effects under certain conditions. We provide a sound and complete algorithm for finding all transit clusters in a given graph and demonstrate how clustering can simplify the identification of causal effects. We also study the inverse problem, where one starts with a clustered graph and looks for extended graphs where the identifiability properties of causal effects remain unchanged. We show that this kind of structural robustness is closely related to transit clusters.
翻译:图表通常用来代表因果关系并直观地描述因果关系。 对于少数变量,这个方法提供了手头情景的简洁和清晰的视角。随着研究中的变量数量的增加,图形方法可能变得不切实际,其代表性也丧失了。变量的分组是一种自然方式,可以减少因果图表的大小,但如果任意执行,则可能会错误地改变因果关系的基本属性。我们定义了特定类型的集群,称为过境集群,保证在某些条件下保存因果关系的可识别性。我们提供了一种合理和完整的算法,用于在特定图表中查找所有过境集群,并演示集群如何简化因果效应的识别。我们还研究了反向问题,先用组合图开始,然后在因果关系的可识别性保持不变的情况下寻找扩展的图表。我们表明,这种结构稳健性与过境集群密切相关。