Deterministic variables are variables that are fully explained by one or more parent variables. They commonly arise when a variable has been algebraically constructed from one or more parent variables, as with composite variables, and in compositional data, where the 'whole' variable is determined from its 'parts'. This article introduces how deterministic variables may be depicted within directed acyclic graphs (DAGs) to help with identifying and interpreting causal effects involving tautological associations, compositional data, and composite variables. We propose a two-step approach in which all variables are initially considered, and an explicit choice is then made whether to focus on the deterministic variable(s) or the determining parents. Depicting deterministic variables within DAGs bring several benefits. It is easier to identify and avoid misinterpreting tautological associations, i.e., self-fulfilling associations between variables with shared algebraic parent variables. In compositional data, it is easier to understand the consequences of conditioning on the 'whole' variable, and correctly identify total and relative causal effects. For composite variables, it encourages greater consideration of the target estimand and greater scrutiny of the consistency and exchangeability assumptions. DAGs with deterministic variables are a useful aid for planning and interpreting analyses involving tautological associations, compositional data, and/or composite variables.
翻译:确定性变量是由父变量或母变量充分解释的变量。当变量从一个或多个父变量(如复合变量)和组成数据(如复合变量)对变量进行代数构建时,通常会出现这些变量。当“全”变量从其“parts”中确定时,该文章介绍了如何在定向循环图(DAGs)中描述确定性变量,以帮助识别和解释由同位父变量、组成数据和复合变量组成的因果关系。我们提出了一个两步方法,即最初考虑所有变量,然后作出明确的选择,即侧重于确定性变量或确定性父母。在DAGs中淡化“全”变量带来若干好处。本文章介绍了如何在定向循环图(DAGs)中识别和避免曲解曲性变异,即自补变量与同位父变量之间的关联。在组成数据中,更容易理解调整“全”变量的后果,并正确识别总和相对因果关系。对于复合变量而言,鼓励更多地考虑目标变量的变数和变数分析,在分析中进行更精确性分析。