In the field of car evaluation, more and more netizens choose to express their opinions on the Internet platform, and these comments will affect the decision-making of buyers and the trend of car word-of-mouth. As an important branch of natural language processing (NLP), sentiment analysis provides an effective research method for analyzing the sentiment types of massive car review texts. However, due to the lexical professionalism and large text noise of review texts in the automotive field, when a general sentiment analysis model is applied to car reviews, the accuracy of the model will be poor. To overcome these above challenges, we aim at the sentiment analysis task of car review texts. From the perspective of word vectors, pre-training is carried out by means of whole word mask of proprietary vocabulary in the automotive field, and then training data is carried out through the strategy of an adversarial training set. Based on this, we propose a car review text sentiment analysis model based on adversarial training and whole word mask BERT(ATWWM-BERT).
翻译:在汽车评估领域,越来越多的网民选择在互联网平台上表达自己的意见,这些评论将影响买家的决策和汽车嘴单的倾向。作为自然语言处理的一个重要分支,情绪分析为分析大规模汽车审查文本的情绪类型提供了有效的研究方法。然而,由于汽车领域的词汇专业精神和审查文本的大量文字噪音,当汽车领域应用一般情绪分析模型来进行汽车审查时,模型的准确性将很低。为了克服上述挑战,我们的目标是汽车审查文本的情绪分析任务。从文字载体的角度来看,培训前用汽车领域的专有词汇的全字面罩进行,然后通过一套对抗性培训战略进行培训数据。在此基础上,我们提出一个基于对抗性培训和全字面遮罩BERT(ATWM-BERT)的汽车审查文本情绪分析模型。