Automatically tuning software configuration for optimizing a single performance attribute (e.g., minimizing latency) is not trivial, due to the nature of the configuration systems (e.g., complex landscape and expensive measurement). To deal with the problem, existing work has been focusing on developing various effective optimizers. However, a prominent issue that all these optimizers need to take care of is how to avoid the search being trapped in local optima -- a hard nut to crack for software configuration tuning due to its rugged and sparse landscape, and neighboring configurations tending to behave very differently. Overcoming such in an expensive measurement setting is even more challenging. In this paper, we take a different perspective to tackle this issue. Instead of focusing on improving the optimizer, we work on the level of optimization model. We do this by proposing a meta multi-objectivization model (MMO) that considers an auxiliary performance objective (e.g., throughput in addition to latency). What makes this model unique is that we do not optimize the auxiliary performance objective, but rather use it to make similarly-performing while different configurations less comparable (i.e. Pareto nondominated to each other), thus preventing the search from being trapped in local optima. Experiments on eight real-world software systems/environments with diverse performance attributes reveal that our MMO model is statistically more effective than state-of-the-art single-objective counterparts in overcoming local optima (up to 42% gain), while using as low as 24% of their measurements to achieve the same (or better) performance result.
翻译:优化单一性能属性(例如,最大限度地降低延迟度)自动调整软件配置以优化单一性能属性(例如,最大限度地降低延迟度)并非微不足道,因为配置系统的性质(例如,复杂地貌和昂贵的测量测量方法) 。为了解决这个问题,现有工作一直侧重于开发各种有效的优化剂。然而,所有这些优化者需要注意的一个突出问题是,如何避免搜索被困在本地opima -- -- 一种硬坚果,因为软件配置的调整会因其崎岖和分散的地貌而发生差异。在昂贵的测量环境中,克服这种情况甚至更具挑战性。在本文中,我们从不同的角度来解决这一问题。我们不是侧重于改进优化剂,而是在优化模型的层次上工作。我们这样做的方式是提出一个元性多感化模型,该模型考虑一个辅助性能目标(例如,除粘度外,通过吞吐量来调整软件模型),而这种模型的独特之处在于,我们并没有优化辅助性能目标,而是在昂贵的测量环境中进行类似性的工作,而不同性能的配置则比较不具有可比性(例如,在实际的24度上对等的系统进行不甚甚易地进行实地的实验性业绩。因此,而使每个系统都无法进行实地的实验性能。