Machine learning algorithms have enabled computers to predict things by learning from previous data. The data storage and processing power are increasing rapidly, thus increasing machine learning and Artificial intelligence applications. Much of the work is done to improve the accuracy of the models built in the past, with little research done to determine the computational costs of machine learning acquisitions. In this paper, I will proceed with this later research work and will make a performance comparison of multi-threaded machine learning clustering algorithms. I will be working on Linear Regression, Random Forest, and K-Nearest Neighbors to determine the performance characteristics of the algorithms as well as the computation costs to the obtained results. I will be benchmarking system hardware performance by running these multi-threaded algorithms to train and test the models on a dataset to note the differences in performance matrices of the algorithms. In the end, I will state the best performing algorithms concerning the performance efficiency of these algorithms on my system.
翻译:机器学习算法使计算机能够通过从先前的数据中学习来预测事物。数据存储和处理能力正在迅速增长,从而增加了机器学习和人工智能应用。许多工作都是为了提高过去所建模型的准确性,而没有做多少研究来确定机器学习获取的计算成本。在本文件中,我将着手进行这项后来的研究工作,并将对多轨机器学习组合算法进行性能比较。我将研究线性回归、随机森林和K-Nearest Near nearbors,以确定算法的性能特点以及计算所得结果的成本。我将是基准系统硬件性能,办法是运行这些多轨算法,在数据集上培训和测试模型,以注意到算法性矩阵的差异。最后,我将说明系统这些算法的性能效率方面的最佳算法。