项目名称: 云计算平台中大规模交互式服务长尾延迟消减关键技术研究
项目编号: No.61502451
项目类型: 青年科学基金项目
立项/批准年度: 2016
项目学科: 自动化技术、计算机技术
项目作者: 韩锐
作者单位: 中国科学院计算技术研究所
项目金额: 21万元
中文摘要: 大规模交互式服务通常将请求切分到多个组件上并行执行,因此请求延迟取决于组件长尾延迟(即最慢组件的延迟)。在云计算平台中,组件与并发批处理作业共享和竞争节点资源如高速缓存和I/O带宽而受到性能干扰,是造成组件性能差异性和变化性及高长尾延迟的主要因素。随着服务规模与复杂度的增加,高长尾延迟已成为制约其性能和收益提高的瓶颈,是云平台应用管理的关键技术难题。本课题将围绕云计算混合负载运行环境下大规模交互式服务长尾延迟问题,从性能干扰定量刻画和预测、组件层次细粒度延迟消减技术、及面向重点领域应用长尾延迟消减验证原型系统三个方面,开展系统的研究工作;重点研究长尾延迟消减的可预测性、精确控制性、高可用性三个科学问题。课题研究成果将有助于解决云平台中大规模分布式系统管理的关键技术挑战,提升我国云计算自主创新能力,并促进我国云计算快速发展。
中文关键词: 云计算;交互式服务;组件;性能干扰;长尾延迟
英文摘要: Large-scale interactive services usually divide requests into multiple sub-requests and distribute them to a large number of server components for parallel execution. Hence the tail latency (i.e. the slowest component's latency) of these components determines the overall service latency. On a cloud platform, each component shares and competes node resources such as caches and I/O bandwidths with its concurrently executing batch jobs, thus inevitably suffering from their performance interference, which is often regarded as the major reason for component latency heterogeneity and variability as well as high tail latency. With the increasing size and complexity of services, high tail latency has emerged as the bottleneck for service performance and profit improvement, and it is the key challenge to be addressed in cloud application management. This project will focus on the tail latency problem of large-scale interactive services within the context of mixed workloads running in the cloud. The project will conduct a systematic investigation from three aspects: the quantitative description and prediction of performance interference, fine-grained component-level tail latency mitigation techniques, and a prototype system designed to verify the proposed techniques for applications in the key fields. Our study aims at technologies that ensure three properties in tail latency mitigation: predictability, precise controllability and high practicability. This project will be helpful for solving the key challenges in managing large-scale distributed systems in the cloud, enhance China's capability of independent innovation and promote the rapid development of cloud computing.
英文关键词: Cloud computing;Interactive service;Server component;Performance interference;Tail latency