In data centers, the nature of the composite bursty traffic along with the small bandwidth-delay product and switch buffers lead to several congestion problems that are not handled well by traditional congestion control mechanisms such as TCP. Existing work try to address the problem by modifying TCP to suit the operational nature of data centers. This is practically feasible in private settings, however, in public environments, such modifications are prohibited. Therefore, in this work, we design simple switch-based queue management to deal with such congestion issues adequately. This approach entails no modification to the TCP sender and receiver algorithms which enables easy and seamless deployment in public data centers. We present a theoretical analysis to show the stability and effectiveness of the scheme. We also present, three different real implementations (as a Linux kernel module and as an added feature to OpenvSwitch) and give numerical results from both NS-2 simulation and experiments of real deployment in a small test-bed cluster to show its effectiveness in achieving high throughput overall, a good fairness and short flow completion times for delay-sensitive flows.
翻译:在数据中心,混合爆炸性交通的性质,加上小型带宽延迟产品和开关缓冲器,导致若干拥挤问题,而传统交通拥堵控制机制,如TCP等,处理得不好。现有工作试图通过修改TCP来解决这个问题,以适应数据中心的运作性质。然而,在私人环境中,这种修改实际上可行,但在公共环境中是禁止的。因此,在这项工作中,我们设计简单的基于开关的队列管理,以充分处理这种拥堵问题。这一方法不要求修改TCP发件人和接收器的算法,使公共数据中心能够方便无缝地部署。我们提出理论分析,以显示该计划的稳定性和有效性。我们还提出三种不同的实际执行(作为Linux内核模块和OpenvSwitch的附加特性),并提供NS-2模拟和在小型试床集群实际部署实验的数值结果,以显示其实现高吞量、对延迟敏感流动的公平性和短流完成时间的有效性。