The growth of video streaming has stretched the Internet to its limitation. In other words, the Internet was originally devised to connect a limited number of computers so that they can share network resources, so the Internet cannot handle a large amount of traffic at a time, which leads to network congestion. To overcome this, CDNs are built on top of the Internet as an overlay to efficiently store and swiftly disseminate contents to users by placing many servers and data centers around the globe. The topic of CDNs has been extensively studied in the last several decades. However, there is still a certain gap between theories in academia and current technologies in industry. In this paper, we take a close look at the design, implementation, solution, and performance of a CDN system by analyzing its raw log files. Specifically, its infrastructure and system design are first presented, and then we conduct a trace-based study to understand user access patterns, the sources of requests, system performance, and how such information can be used to improve the whole CDN system.
翻译:视频流的增长将互联网拉到了极限。换句话说,互联网的最初设计是为了连接数量有限的计算机,使他们能够共享网络资源,因此互联网无法同时处理大量交通,从而导致网络拥堵。为了克服这一点,CDN建在互联网的顶端,作为在全球放置许多服务器和数据中心以有效存储和迅速向用户传播内容的铺垫。过去几十年对CDN专题进行了广泛研究。然而,学术界的理论和当前工业技术之间仍然存在着一定的差距。在本文件中,我们仔细研究CDN系统的设计、实施、解决方案和性能,分析其原始日志文件。具体地说,首先介绍其基础设施和系统设计,然后进行追踪研究,以了解用户访问模式、请求来源、系统性能以及如何利用这类信息改进整个CDN系统。