Causal machine learning is a field that focuses on using machine learning methods to tackle causality problems. Despite the recent progress in this field, there are still many unresolved challenges, including missing data, selection bias, unobserved confounders, etc., which are ubiquitous in the real world. Advances in any of the above areas can greatly reduce the gap between research and real-world impact. In this competition, we focus on two fundamental challenges of causal machine learning in the context of education using time-series data. The first is to identify the causal relationships between different constructs, where a construct is defined as the smallest element of learning. The second challenge is to predict the impact of learning one construct on the ability to answer questions on other constructs. Addressing these challenges will not only impact the causal ML community but also enable optimisation of students' knowledge acquisition, which can be deployed in a real edtech solution impacting millions of students. Participants will run these tasks in an idealised environment with synthetic data and a real-world scenario with evaluation data collected from a series of A/B tests. We expect participants to develop novel machine learning methodologies for causal discovery between different constructs and the impact estimation of learning one construct on other constructs, which should bring fundamental advances to causal ML.
翻译:原因机器学习是一个侧重于使用机器学习方法解决因果关系问题的领域。尽管最近在这一领域取得了一些进展,但仍有许多尚未解决的挑战,包括缺少数据、选择偏向、未观察到的困惑者等,这些在现实世界中普遍存在。上述任何领域的进展都可以大大缩小研究与现实世界影响之间的差距。在这场竞争中,我们集中关注在使用时间序列数据进行教育的背景下因果机器学习的两个基本挑战。首先,确定不同建筑之间的因果关系,其中将建筑定义为最小的学习要素。第二个挑战是预测学习一个建筑对回答其他建筑问题的能力的影响。应对这些挑战不仅将影响有因果关系的ML社区,而且还能够优化学生的知识获取,这可以在影响数百万学生的真正技术解决方案中部署。参与者将在一个理想的环境中运用合成数据,用从一系列A/B测试中收集的评价数据来应对现实世界情景。我们期望参与者开发创新的机器学习方法,用以构建不同因果因素的发现,在构建不同的M/B测试中构建一种基本结论之间,在构建一种因果关系的学习。