Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may become impractical, and the clarity of the representation is lost. Clustering of variables is a natural way to reduce the size of the causal diagram, but it may erroneously change the essential properties of the causal relations if implemented arbitrarily. We define a specific type of cluster, called transit cluster, that is guaranteed to preserve the identifiability properties of causal effects under certain conditions. We provide a sound and complete algorithm for finding all transit clusters in a given graph and demonstrate how clustering can simplify the identification of causal effects. We also study the inverse problem, where one starts with a clustered graph and looks for extended graphs where the identifiability properties of causal effects remain unchanged. We show that this kind of structural robustness is closely related to transit clusters.
翻译:图表通常用来代表因果关系并直观地描述因果关系。 对于少数变量, 这种方法对手头的假设情况提供了简洁和清晰的描述。 随着正在研究的变量数量的增加, 图形方法可能变得不切实际, 表达方式的清晰度也丧失了。 组合变量是减少因果图表大小的自然方式, 但是如果任意执行, 可能会错误地改变因果关系的基本特性 。 我们定义了特定类型的集群, 称为过境集群, 保证在某些条件下保存因果的可识别性。 我们为在特定图表中查找所有过境集群提供了合理和完整的算法, 并演示集群如何简化因果效应的识别。 我们还研究反向问题, 其中先用组合图开始, 寻找延长的图表, 其因果关系的可识别性保持不变 。 我们表明, 这种结构坚固性与过境集群密切相关 。