As an application of Natural Language Processing (NLP) techniques, financial sentiment analysis (FSA) has become an invaluable tool for investors. Its speed and accuracy can significantly impact the returns of trading strategies.With the development of deep learning and Transformer-based pre-trained models like BERT, the accuracy of FSA has been much improved, but these time-consuming big models will also slow down the computation. To boost the processing speed of the FSA system and ensure high precision, we first propose an efficient and lightweight BERT (ELBERT) along with a novel confidence-window-based (CWB) early exit mechanism. Based on ELBERT, an innovative method to accelerate text processing on the GPU platform is developed, solving the difficult problem of making the early exit mechanism work more effectively with a large input batch size. Afterward, a fast and high-accuracy FSA system is built. Experimental results show that the proposed CWB early exit mechanism achieves significantly higher accuracy than existing early exit methods on BERT under the same computation cost. Besides, our FSA system can boost the processing speed to over 1000 texts per second with sufficient accuracy by using this acceleration method, which is nearly twice as fast as the FastBERT. Hence, this system can enable modern trading systems to quickly and accurately process financial text data.
翻译:作为自然语言处理(NLP)技术的应用,金融情绪分析(FSA)已成为投资者的宝贵工具,其速度和准确性可以极大地影响贸易战略的回报。随着深入学习和基于变异的预培训模型(如BERT)的开发,FSA的准确性已经大大提高,但这些耗时的大型模型也将减缓计算速度。为了提高FSA系统的处理速度并确保高精确度,我们首先提出一个高效和轻量级的BERT(ELBERT),以及一个新的基于信任的窗口(CWB)早期退出机制。在ELBERT的基础上,开发了加速GPU平台文本处理的创新方法,用大批量投入解决使早期退出机制更加有效的难题。之后,将建立一个快速和高精确的FSA系统。实验结果表明,拟议的CWB早期退出机制在计算成本下比BERT现有的早期退出方法(EB)实现显著的准确性更高的准确性。此外,我们的FSA系统可以将加速处理速度提高到1000多至每1,000份文本,而快速地将数据作为快速进行。