In this paper, we give an overview for the shared task at the CCF Conference on Natural Language Processing \& Chinese Computing (NLPCC 2017): Chinese News Headline Categorization. The dataset of this shared task consists 18 classes, 12,000 short texts along with corresponded labels for each class. The dataset and example code can be accessed at https://github.com/FudanNLP/nlpcc2017_news_headline_categorization.
翻译:在本文中,我们概述了在CCF自然语言处理中国计算会议(2017年NLPCC):中国新闻标题分类(中国新闻标题分类)上分担的任务。这一共同任务的数据集包括18个班,12 000个短文本,以及每个班的对应标签。数据集和示例代码可在https://github.com/FudanNLP/nlpcc2017_news_headline_clacgorization上查阅。