Benchmarking is commonly used in research fields, such as computer architecture design and machine learning, as a powerful paradigm for rigorously assessing, comparing, and developing novel technologies. However, the data centre networking community lacks a standard open-access benchmark. This is curtailing the community's understanding of existing systems and hindering the ability with which novel technologies can be developed, compared, and tested. We present TrafPy; an open-access framework for generating both realistic and custom data centre network traffic traces. TrafPy is compatible with any simulation, emulation, or experimentation environment, and can be used for standardised benchmarking and for investigating the properties and limitations of network systems such as schedulers, switches, routers, and resource managers. To demonstrate the efficacy of TrafPy, we use it to conduct a thorough investigation into the sensitivity of 4 canonical scheduling algorithms (shortest remaining processing time, fair share, first fit, and random) to varying traffic trace characteristics. We show how the fundamental scheduler performance insights revealed by these tests translate to 4 realistic data centre network types; University, Private Enterprise, Commercial Cloud, and Social Media Cloud. We then draw conclusions as to which types of scheduling policies are most suited to which types of network load conditions and traffic characteristics, leading to the possibility of application-informed decision making at the design stage and new dynamically adaptable scheduling policies. TrafPy is open-sourced via GitHub and all data associated with this manuscript via RDR.
翻译:基准通常用于研究领域,例如计算机建筑设计和机器学习,作为严格评估、比较和开发新技术的强大范例;然而,数据中心网络社区缺乏标准的开放基准,这削弱了社区对现有系统的理解,妨碍了社区对现有系统的理解,妨碍了开发、比较和测试新技术的能力。我们介绍了TrafPy;一个开放接入框架,以生成现实和自定制数据中心网络交通跟踪。TrafPy与任何模拟、模拟或实验环境相容,可用于标准化基准制定和调查诸如排期器、开关、路由器和资源管理者等网络系统的特点和局限性。为了展示TrafPy的功效,我们利用它彻底调查了四种卡通性算法的敏感性(即剩余处理时间、公平份额、第一合适和随机性),以产生不同的交通跟踪特征。我们展示了这些测试所揭示的基本调度员的公开业绩洞察力如何转化为4种现实的数据中心网络类型;大学、私营企业、商业云端、路由器和资源管理者等网络的属性和局限性。然后,我们用它来彻底地调查四类的轨道定位政策,从而得出了最可靠的地理结构定位,从而得出了最可靠的轨道定位和通过云层决定的路径。