The existing resource allocation policy for application instances in Kubernetes cannot dynamically adjust according to the requirement of business, which would cause an enormous waste of resources during fluctuations. Moreover, the emergence of new cloud services puts higher resource management requirements. This paper discusses horizontal POD resources management in Alibaba Cloud Container Services with a newly deployed AI algorithm framework named AHPA -- the adaptive horizontal pod auto-scaling system. Based on a robust decomposition forecasting algorithm and performance training model, AHPA offers an optimal pod number adjustment plan that could reduce POD resources and maintain business stability. Since being deployed in April 2021, this system has expanded to multiple customer scenarios, including logistics, social networks, AI audio and video, e-commerce, etc. Compared with the previous algorithms, AHPA solves the elastic lag problem, increasing CPU usage by 10% and reducing resource cost by more than 20%. In addition, AHPA can automatically perform flexible planning according to the predicted business volume without manual intervention, significantly saving operation and maintenance costs.
翻译:Kubernetes应用场的现有资源分配政策无法根据业务要求进行动态调整,这将在波动期间造成巨大的资源浪费。此外,新的云层服务的出现会提高资源管理要求。本文讨论了Alibaba Cloud Convention Services的横向POD资源管理,新部署的AI算法框架名为AHPA -- -- 适应性水平舱自动扩缩系统。根据强力分解预测算法和绩效培训模式,AHPA提供了最佳舱号调整计划,可以减少POD资源并维持业务稳定。自2021年4月部署以来,该系统已扩大到多种客户情况,包括后勤、社会网络、AI视听、电子商务等。与以前的算法相比,AHPA解决了弹性滞后问题,将CPU的使用率提高10%,将资源成本降低20%以上。此外,AHPA可以自动根据预测的经营量进行灵活规划,无需人工干预,大大节省运营和维护费用。</s>