Data collection and research methodology represents a critical part of the research pipeline. On the one hand, it is important that we collect data in a way that maximises the validity of what we are measuring, which may involve the use of long scales with many items. On the other hand, collecting a large number of items across multiple scales results in participant fatigue, and expensive and time consuming data collection. It is therefore important that we use the available resources optimally. In this work, we consider how a consideration for theory and the associated causal/structural model can help us to streamline data collection procedures by not wasting time collecting data for variables which are not causally critical for subsequent analysis. This not only saves time and enables us to redirect resources to attend to other variables which are more important, but also increases research transparency and the reliability of theory testing. In order to achieve this streamlined data collection, we leverage structural models, and Markov conditional independency structures implicit in these models to identify the substructures which are critical for answering a particular research question. In this work, we review the relevant concepts and present a number of didactic examples with the hope that psychologists can use these techniques to streamline their data collection process without invalidating the subsequent analysis. We provide a number of simulation results to demonstrate the limited analytical impact of this streamlining.
翻译:数据收集和研究方法是研究管道的一个关键部分。一方面,我们收集数据的方式必须最大限度地提高我们所测量的变量的可靠性,这可能涉及使用许多物品的长尺度。另一方面,收集多种规模的大量项目,结果导致参与者疲劳、昂贵和耗时的数据收集。因此,我们必须以最佳方式利用现有的资源。在这项工作中,我们考虑考虑理论和相关的因果/结构模型如何帮助我们简化数据收集程序,不浪费时间收集对随后分析不具有因果重要性的变量的数据。这不仅节省时间,使我们能够将资源转用于更重要的其他变量,而且还能提高研究透明度和理论测试的可靠性。为了实现这一简化的数据收集,我们利用结构模型,并以这些模型所隐含的有条件依赖结构来确定对回答特定研究问题至关重要的亚结构。在这项工作中,我们审查相关概念并提出若干细微例子,希望心理学家们能够利用这些分析方法来简化其随后的分析工作结果,但又不要求简化这些分析结果。