We present the Radar de Parit\'e, an automated Natural Language Processing (NLP) system that measures the proportion of women and men quoted daily in six Canadian French-language media outlets. We outline the system's architecture and detail the challenges we overcame to address French-specific issues, in particular regarding coreference resolution, a new contribution to the NLP literature on French. We also showcase statistics covering over one year's worth of data (282,512 news articles). Our results highlight the underrepresentation of women in news stories, while also illustrating the application of modern NLP methods to measure gender representation and address societal issues.
翻译:我们提出了Radar de Parité,这是一个自动化的自然语言处理系统,可测量六个加拿大法语媒体机构每日引用的女性和男性比例。我们概述了该系统的架构,并详细说明了我们克服的挑战,特别是有关指代消解的法语特定问题,这是一项针对法语的自然语言处理文献的新贡献。我们还展示了超过一年数据(282,512篇新闻文章)的统计数据。我们的结果突显了新闻报道中女性的低代表性,同时也说明了应用现代自然语言处理方法来测量性别代表性并解决社会问题的能力。