Women, visible minorities, and other socially disadvantaged groups continue to be underrepresented in STEM education. Understanding students' motivations for pursuing a STEM major, and the roles gender, ethnicity, parental education attainment, and socio-economic background play in shaping students' motivations can support the design of more effective recruitment efforts towards these groups. In this paper, we propose and develop a novel text mining approach incorporating the Latent Dirichlet Allocation and word embeddings to extract and analyze applicants' motivational factors to choosing an engineering program. We apply the proposed method to a data set of over 40,000 applications to the engineering school of a large North American university. We then investigate the relationship between applicants' gender, nationality, family income, and educational attainment, and their stated motivations for applying to their engineering program of choice. We find that interest in technology and the desire to make social impacts are the two most powerful motivators for applicants. Additionally, while we find significant motivational differences related to applicants' nationality and family socioeconomic status, gender differences are isolated from the effects of these factors.
翻译:在STEM教育中,妇女、有色人种和其他社会处境不利群体的代表性仍然不足。了解学生追求STEM大修的动机,以及性别、族裔、父母教育程度和社会经济背景在影响学生动机方面扮演的角色,可以支持设计更有效的针对这些群体的招聘工作。在本文件中,我们提出并发展了一种新颖的文本采矿方法,其中纳入了“Leent Dirichlet分配”和“语言嵌入”等内容,以提取和分析申请人选择工程方案的动机因素。我们对北美一所大型大学工程学院的40,000多份申请数据集采用了拟议方法。我们随后调查了申请人的性别、国籍、家庭收入和教育成就之间的关系,以及申请其选择工程方案的动机。我们发现,对技术和产生社会影响的愿望是申请人的两个最有力的动力驱动器。此外,虽然我们发现与申请人的国籍和家庭社会经济地位有关的重大动机差异,但性别差异与这些因素的影响是孤立的。