In this paper, a multi-objective approach for the design of composite data-driven mathematical models is proposed. It allows automating the identification of graph-based heterogeneous pipelines that consist of different blocks: machine learning models, data preprocessing blocks, etc. The implemented approach is based on a parameter-free genetic algorithm (GA) for model design called GPComp@Free. It is developed to be part of automated machine learning solutions and to increase the efficiency of the modeling pipeline automation. A set of experiments was conducted to verify the correctness and efficiency of the proposed approach and substantiate the selected solutions. The experimental results confirm that a multi-objective approach to the model design allows achieving better diversity and quality of obtained models. The implemented approach is available as a part of the open-source AutoML framework FEDOT.
翻译:本文提出了设计综合数据驱动数学模型的多目标方法,可以自动确定由不同块块组成的基于图形的多式管道:机器学习模型、数据处理预处理区块等。实施的方法基于模型设计“GPComp@free”的无参数遗传算法(GA),发展成为自动机器学习解决方案的一部分,并提高编模管道自动化的效率。进行了一系列试验,以核实拟议方法的正确性和效率,并证实选定的解决办法。实验结果证实,模型设计的多目标方法能够使获得的模式更加多样化和质量更高。实施的方法作为开放源的Automal Automist FEDOT框架的一部分。