This survey delves into the current state of natural language processing (NLP) for four Ethiopian languages: Amharic, Afaan Oromo, Tigrinya, and Wolaytta. Through this paper, we identify key challenges and opportunities for NLP research in Ethiopia. Furthermore, we provide a centralized repository on GitHub that contains publicly available resources for various NLP tasks in these languages. This repository can be updated periodically with contributions from other researchers. Our objective is to identify research gaps and disseminate the information to NLP researchers interested in Ethiopian languages and encourage future research in this domain.
翻译:本次调研深入探讨了埃塞俄比亚四种语言(阿姆哈拉语、奥罗莫语、提格丽尼亚语和乌拉依特语)现有的自然语言处理(NLP)进展。通过本文,我们识别了埃塞俄比亚NLP研究的关键挑战和机遇。此外,我们提供一个在GitHub上的集中存储库,其中包含了这些语言的多种NLP任务的公共资源。其他研究人员还可通过贡献来定期更新该存储库。我们的目标是找到研究空白,并将信息分发给对埃塞俄比亚语言感兴趣的NLP研究人员,以鼓励未来的研究。