In practice, it is very demanding and sometimes impossible to collect datasets of tagged data large enough to successfully train a machine learning model, and one possible solution to this problem is transfer learning. This study aims to assess how transferable are the features between different domains of time series data and under which conditions. The effects of transfer learning are observed in terms of predictive performance of the models and their convergence rate during training. In our experiment, we use reduced data sets of 1,500 and 9,000 data instances to mimic real world conditions. Using the same scaled-down datasets, we trained two sets of machine learning models: those that were trained with transfer learning and those that were trained from scratch. Four machine learning models were used for the experiment. Transfer of knowledge was performed within the same domain of application (seismology), as well as between mutually different domains of application (seismology, speech, medicine, finance). We observe the predictive performance of the models and the convergence rate during the training. In order to confirm the validity of the obtained results, we repeated the experiments seven times and applied statistical tests to confirm the significance of the results. The general conclusion of our study is that transfer learning is very likely to either increase or not negatively affect the predictive performance of the model or its convergence rate. The collected data is analysed in more details to determine which source and target domains are compatible for transfer of knowledge. We also analyse the effect of target dataset size and the selection of model and its hyperparameters on the effects of transfer learning.
翻译:在实践中,收集大量贴标签数据以成功培训机器学习模型非常要求很高,有时也不可能收集如此庞大的、足以成功培训机器学习模型的数据集,这个问题的一个可能解决办法是转移学习。本研究的目的是评估不同时间序列数据领域和条件的特征如何可转让。从模型的预测性能和训练期间的趋同率的角度观察转移学习的效果。在实验中,我们用1,500和9,000个数据组减少了数据组的预测性能来模仿现实世界状况。使用同样的缩放数据集,我们培训了两套机器学习模型:经过转让学习培训的模型和从零到零训练的模型。四个机器学习模型用于试验。知识的转让是在相同的应用领域(地震学)内进行的,以及在不同的应用领域(地震学、言论、医学、金融)之间进行。我们观察模型的预测性能和训练期间的趋同率。为了确认所获得的结果的有效性,我们重复了7次试验,并应用了两套机器学习模型测试,以证实结果的重要性。我们研究的总结论是,知识转移的目标范围可能从负面分析到数据的传播速度,而不是分析其数据转换速度。我们所收集到的数据分析。