Time-series forecasting is an important task in both academic and industry, which can be applied to solve many real forecasting problems like stock, water-supply, and sales predictions. In this paper, we study the case of retailers' sales forecasting on Tmall|the world's leading online B2C platform. By analyzing the data, we have two main observations, i.e., sales seasonality after we group different groups of retails and a Tweedie distribution after we transform the sales (target to forecast). Based on our observations, we design two mechanisms for sales forecasting, i.e., seasonality extraction and distribution transformation. First, we adopt Fourier decomposition to automatically extract the seasonalities for different categories of retailers, which can further be used as additional features for any established regression algorithms. Second, we propose to optimize the Tweedie loss of sales after logarithmic transformations. We apply these two mechanisms to classic regression models, i.e., neural network and Gradient Boosting Decision Tree, and the experimental results on Tmall dataset show that both mechanisms can significantly improve the forecasting results.
翻译:时间序列预测是学术和行业的一项重要任务,可用于解决股票、水供应和销售预测等许多实际预测问题。在本文中,我们研究了Tmall ⁇ the世界领先的在线B2C平台上的零售商销售预测案例。通过分析数据,我们有两个主要观察点,即我们将不同批次的零售分类之后的销售季节性以及我们变换销售(预测目标)之后的Tweedie分销。根据我们的观察,我们设计了两种销售预测机制,即季节性提取和分销转型。首先,我们采用了Fourier分解法,自动提取不同类别的零售商的季节性,这可以进一步用作任何既定回归算法的附加特征。第二,我们提议在对数转换后优化Tweedie的销售损失。我们将这些机制应用于典型的回归模型,即神经网络和重力推动决定树,以及Tmall数据集的实验结果显示,两种机制都能大大改进预测结果。