The unpredictability and volatility of the stock market render it challenging to make a substantial profit using any generalised scheme. Many previous studies tried different techniques to build a machine learning model, which can make a significant profit in the US stock market by performing live trading. However, very few studies have focused on the importance of finding the best features for a particular trading period. Our top approach used the performance to narrow down the features from a total of 148 to about 30. Furthermore, the top 25 features were dynamically selected before each time training our machine learning model. It uses ensemble learning with four classifiers: Gaussian Naive Bayes, Decision Tree, Logistic Regression with L1 regularization, and Stochastic Gradient Descent, to decide whether to go long or short on a particular stock. Our best model performed daily trade between July 2011 and January 2019, generating 54.35% profit. Finally, our work showcased that mixtures of weighted classifiers perform better than any individual predictor of making trading decisions in the stock market.
翻译:股市的不可预测性和波动性使得利用任何通用计划获得大量利润成为一项挑战。许多先前的研究曾尝试过不同技术来建立机器学习模式,通过进行实战交易可以在美国股票市场中获取巨大利润。然而,很少有研究侧重于在特定交易期找到最佳特征的重要性。我们最顶尖的方法用业绩来将特征从总共148个缩小到大约30个。此外,在每次培训机器学习模式之前,最顶尖的25个特征都是动态选择的。它使用与四个分类者(高西亚纳维·贝耶斯、决策树、L1正规化的物流回归和Stochatic Gradient Empentle)的混合学习,以决定是长还是短。我们最好的模型在2011年7月至2019年1月之间进行了日常贸易,产生了54.35%的利润。最后,我们的工作展示了加权分类者的混合物比在股票市场上作出贸易决定的任何个人预测者都好。