Recent automated machine learning systems often use learning curves ranking models to inform decisions about when to stop unpromising trials and identify better model configurations. In this paper, we present a novel learning curve ranking model specifically tailored for ranking normalized entropy (NE) learning curves, which are commonly used in online advertising and recommendation systems. Our proposed model, self-Adaptive Curve Transformation augmented Relative curve Ranking (ACTR2), features an adaptive curve transformation layer that transforms raw lifetime NE curves into composite window NE curves with the window sizes adaptively optimized based on both the position on the learning curve and the curve's dynamics. We also introduce a novel differentiable indexing method for the proposed adaptive curve transformation, which allows gradients with respect to the discrete indices to flow freely through the curve transformation layer, enabling the learned window sizes to be updated flexibly during training. Additionally, we propose a pairwise curve ranking architecture that directly models the difference between the two learning curves and is better at capturing subtle changes in relative performance that may not be evident when modeling each curve individually as the existing approaches did. Our extensive experiments on a real-world NE curve dataset demonstrate the effectiveness of each key component of ACTR2 and its improved performance over the state-of-the-art.
翻译:最近的自动化机器学习系统经常使用学习曲线排名模型,为决定何时停止没有希望的试验和确定更好的模型配置提供参考。在本文中,我们提出了一个新的学习曲线排名模型,这是专门为分级的正常英特罗比(NE)学习曲线而设计的,在在线广告和建议系统中普遍使用。我们提议的模型,即自开发曲线变形,在培训期间使所学窗口大小能够灵活更新。此外,我们建议了一种双向曲线排序结构,直接模拟两个学习曲线之间的差异,并更好地捕捉在以学习曲线和曲线动态的每个曲线位置为模型时可能不明显的相对性能的细微变化。我们关于独立指数的梯度可以自由通过曲线变换层,使学习窗口大小在培训期间能够灵活地更新。此外,我们建议了一种双向曲线排序结构,直接模拟两个学习曲线之间的差异,并更好地捕捉到每个曲线的细微变化,而这些变化在分别模拟学习曲线和每个曲线的位置时可能并不明显。我们对于每个曲线的拟议适应曲线变式曲线变异性曲线的变化进行了新的指数方法的广泛实验。