In the current stock market, computer science and technology are more and more widely used to analyse stocks. Not same as most related machine learning stock price prediction work, this work study the predicting the tendency of the stock price on the second day right after the disclosure of the companies' annual reports. We use a variety of different models, including decision tree, logistic regression, random forest, neural network, prototypical networks. We use two sets of financial indicators (key and expanded) to conduct experiments, these financial indicators are obtained from the EastMoney website disclosed by companies, and finally we find that these models are not well behaved to predict the tendency. In addition, we also filter stocks with ROE greater than 0.15 and net cash ratio greater than 0.9. We conclude that according to the financial indicators based on the just-released annual report of the company, the predictability of the stock price movement on the second day after disclosure is weak, with maximum accuracy about 59.6% and maximum precision about 0.56 on our test set by the random forest classifier, and the stock filtering does not improve the performance. And random forests perform best in general among all these models which conforms to some work's findings.
翻译:在目前的股票市场中,计算机科学和技术越来越多地被更广泛地用于分析股票。与大多数相关的机器学习股票价格预测工作不同,这项工作研究的是预测股票价格的趋势,在公司年度报告披露后的第二天,即公司年度报告披露后第二天。我们使用各种不同的模型,包括决策树、物流回归、随机森林、神经网络、原型网络。我们使用两套金融指标(钥匙和扩大)来进行实验,这些金融指标是从公司披露的东方货币网站获得的,最后我们发现这些模型没有很好地预测趋势。此外,我们还用0.15美元以上的ROE过滤股票,净现金比率大于0.9。我们的结论是,根据公司刚刚发布的年度报告得出的财务指标,股票价格在披露后第二天的可预测性很弱,随机森林分类者对我们的测试设定的高度精确度为59.6%,最高精确度为0.56美元,而库存过滤则没有改善业绩。随机森林在所有这些符合某些工作结论的模型中表现最优。