In Quality-Diversity (QD) algorithms, which evolve a behaviourally diverse archive of high-performing solutions, the behaviour space is a difficult design choice that should be tailored to the target application. In QD meta-evolution, one evolves a population of QD algorithms to optimise the behaviour space based on an archive-level objective, the meta-fitness. This paper proposes an improved meta-evolution system such that (i) the database used to rapidly populate new archives is reformulated to prevent loss of quality-diversity; (ii) the linear transformation of base-features is generalised to a feature-map, a function of the base-features parametrised by the meta-genotype; and (iii) the mutation rate of the QD algorithm and the number of generations per meta-generation are controlled dynamically. Experiments on an 8-joint planar robot arm compare feature-maps (linear, non-linear, and feature-selection), parameter control strategies (static, endogenous, reinforcement learning, and annealing), and traditional MAP-Elites variants, for a total of 49 experimental conditions. Results reveal that non-linear and feature-selection feature-maps yield a 15-fold and 3-fold improvement in meta-fitness, respectively, over linear feature-maps. Reinforcement learning ranks among top parameter control methods. Finally, our approach allows the robot arm to recover a reach of over 80% for most damages and at least 60% for severe damages.
翻译:质量- 差异算法(QD) 质量- 差异算法(QD) 质量- 差异(QD) 算法(QD) 中,行为空间是一个难以选择的设计选择,应该根据目标应用量来定制。在 QD 元进化中,一个QD 元进化算法(QD 算法) 来优化行为空间,以基于归档级目标、元适合性为基础优化行为空间。本文建议改进元进化系统,以便(一) 用于迅速传播新档案的数据库重新配置,以防止质量多样性的丧失;(二) 基本功能的线性转换被概括为特征图(地貌图),这是由元基因类型所覆盖的基础性功能功能函数函数函数的功能;和(三) QD算法的突变速率和每代数是动态控制的。在8- 联合平流机器人臂比较特征图(线性、非线性、和特征选择) 参数控制战略(静态、 内部、 强化学习和肛界方法中最小化) 基准变法(Stenci- deal- amal- real- real- real- pal- real- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- pal- sal- sal- pal- sal- sal- sal- sal- sal- sal- slation- sal- sal- sal- sal- sal- sal- sal- sal- sal- lection- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- laction- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal-