In pro-drop language like Arabic, Chinese, Italian, Japanese, Spanish, and many others, unrealized (null) arguments in certain syntactic positions can refer to a previously introduced entity, and are thus called anaphoric zero pronouns. The existing resources for studying anaphoric zero pronoun interpretation are however still limited. In this paper, we use five data augmentation methods to generate and detect anaphoric zero pronouns automatically. We use the augmented data as additional training materials for two anaphoric zero pronoun systems for Arabic. Our experimental results show that data augmentation improves the performance of the two systems, surpassing the state-of-the-art results.
翻译:在诸如阿拉伯文、中文、意大利文、日文、西班牙文和其他许多赞成滴答的语言中,某些合成位置的未实现(null)论点可以指一个以前引入的实体,因此被称为无无代名词。然而,研究无代名词无无代名解释的现有资源仍然有限。在本文中,我们使用五个数据增强方法自动生成和检测无代名词无代名词。我们用扩大的数据作为两个阿拉伯文无代名无代名系统的额外培训材料。我们的实验结果显示,数据增加可以改善两个系统的性能,超过最新结果。