Extraction of sentiment signals from news text, stock message boards, and business reports, for stock movement prediction, has been a rising field of interest in finance. Building upon past literature, the most recent works attempt to better capture sentiment from sentences with complex syntactic structures by introducing aspect-level sentiment classification (ASC). Despite the growing interest, however, fine-grained sentiment analysis has not been fully explored in non-English literature due to the shortage of annotated finance-specific data. Accordingly, it is necessary for non-English languages to leverage datasets and pre-trained language models (PLM) of different domains, languages, and tasks to best their performance. To facilitate finance-specific ASC research in the Korean language, we build KorFinASC, a Korean aspect-level sentiment classification dataset for finance consisting of 12,613 human-annotated samples, and explore methods of intermediate transfer learning. Our experiments indicate that past research has been ignorant towards the potentially wrong knowledge of financial entities encoded during the training phase, which has overestimated the predictive power of PLMs. In our work, we use the term "non-stationary knowledge'' to refer to information that was previously correct but is likely to change, and present "TGT-Masking'', a novel masking pattern to restrict PLMs from speculating knowledge of the kind. Finally, through a series of transfer learning with TGT-Masking applied we improve 22.63% of classification accuracy compared to standalone models on KorFinASC.
翻译:从新闻文本、股票电文板和商业报告中提取的情绪信号,对于股票流动预测来说,是一个对金融越来越感兴趣的领域。根据以往的文献,最近的一项工作试图通过引入方方面面情绪分类(ASC),从复杂的合成结构中更好地捕捉感知感知。然而,尽管由于缺少附加说明的专项财务数据,非英文文献对微量情绪分析没有进行充分探讨。因此,非英语语言有必要利用不同领域、语言和任务的培训前语言模型(PLM)的数据集和预培训语言模型(PLM),以达到最佳性能。根据以往的文献,我们试图通过引入方方面面情绪分类(ASC),通过引入方方面面情绪分类(ASC)来更好地捕捉感知感。尽管由于缺少附加说明的样本,非英文文献中并未充分探索微量的情绪分析。因此,非英语语言有必要利用不同领域、语言、语言和预先培训语言的语文模型(PLMS)的预测力。在我们的工作中,我们使用“非固定的ASC精度(MA-ML)分类的精确性知识分类(T)“最终修正”是指学习“TLAILA”的模型,最终的规律,从学习“OILA的规律到“R”到“升级”到“ODR(M),最终的规律,从学习的规律,从学习“ODLV的规律学”到“O的规律学”到“O的规律。