Shallow parsing is an essential task for many NLP applications like machine translation, summarization, sentiment analysis, aspect identification and many more. Quality annotated corpora is critical for building accurate shallow parsers. Many Indian languages are resource poor with respect to the availability of corpora in general. So, this paper is an attempt towards creating quality corpora for shallow parsers. The contribution of this paper is two folds: creation pos and chunk annotated corpora for Odia and development of baseline systems for pos tagging and chunking in Odia.
翻译:浅浅剖析是许多国家实验室方案应用的基本任务,如机器翻译、总结、情绪分析、侧面识别等。 质量附加说明对于建立准确的浅面剖析器至关重要。 许多印度语言在一般公司可用性方面资源贫乏。 因此,本文试图为浅面剖析器创造优质剖析器。 本文的贡献是两个折叠:为奥迪亚创建浮雕和块状注解体,以及开发奥迪亚的标注和块状基线系统。