Women, visible minorities, and other socially disadvantaged groups continue to be underrepresented in STEM education. Understanding students' motivations for pursuing a STEM major, and the roles gender, nationality, parental education attainment, and socio-economic background play in shaping students' motivations can support the design of more effective recruitment efforts towards these groups. In this paper, we propose and develop a novel text mining approach incorporating the Latent Dirichlet Allocation and word embeddings to analyze applicants' motivational factors to choosing an engineering program. We apply the proposed method to a data set of over 40,000 applications to the engineering school of a large Canadian university. We then investigate the relationship between applicants' gender, nationality, family income, and educational attainment, and their stated motivations for applying to their engineering program of choice. We find that interest in technology and the desire to make social impact are the two most powerful motivators for applicants. Additionally, while we find significant motivational differences related to applicants' nationality and family socio-economic status, gender differences are isolated from the effects of these factors.
翻译:在STEM教育中,妇女、有色人种和其他社会处境不利群体的代表性仍然不足。了解学生追求STEM大修的动机,以及性别、国籍、父母教育成就和社会经济背景在影响学生动机方面的作用,可以支持设计更为有效的招生工作。在本文件中,我们提出并发展了一种新颖的文字采矿方法,其中纳入了Leent Dirichlet分配和文字嵌入,以分析申请人选择工程方案的动机因素。我们将所提议的方法应用于一个大型加拿大大学工程学校的40,000多份申请数据组。我们然后调查申请人的性别、国籍、家庭收入和教育成就之间的关系,以及申请其工程选择方案的公开动机。我们发现,对技术和产生社会影响的愿望是申请人的两个最有力的动力动力。此外,虽然我们发现与申请人的国籍和家庭社会经济地位有关的重大动机差异,但性别差异与这些因素的影响是孤立的。