Due to the limited permissions for upgrading dualside (i.e., server-side and client-side) loss tolerance schemes from the perspective of CDN vendors in a multi-supplier market, modern large-scale live streaming services are still using the automatic-repeat-request (ARQ) based paradigm for loss recovery, which only requires server-side modifications. In this paper, we first conduct a large-scale measurement study with up to 50 million live streams. We find that loss shows dynamics and live streaming contains frequent on-off mode switching in the wild. We further find that the recovery latency, enlarged by the ubiquitous retransmission loss, is a critical factor affecting live streaming's client-side QoE (e.g., video freezing). We then propose an enhanced recovery mechanism called AutoRec, which can transform the disadvantages of on-off mode switching into an advantage for reducing loss recovery latency without any modifications on the client side. AutoRec allows users to customize overhead tolerance and recovery latency tolerance and adaptively adjusts strategies as the network environment changes to ensure that recovery latency meets user demands whenever possible while keeping overhead under control. We implement AutoRec upon QUIC and evaluate it via testbed and real-world commercial services deployments. The experimental results demonstrate the practicability and profitability of AutoRec.
翻译:由于多供应商市场中CDN厂商对升级双端(即服务器端与客户端)丢包容忍方案的权限有限,现代大规模直播流媒体服务仍采用基于自动重传请求(ARQ)的丢包恢复范式,该方案仅需服务器端修改。本文首先对多达5000万条直播流进行了大规模测量研究,发现实际环境中丢包呈现动态性且直播流媒体频繁发生启停模式切换。我们进一步发现,由普遍存在的重传丢包所扩大的恢复延迟是影响直播流媒体客户端体验质量(如视频卡顿)的关键因素。为此,我们提出一种增强型恢复机制AutoRec,该机制能够在不修改客户端的前提下,将启停模式切换的劣势转化为降低丢包恢复延迟的优势。AutoRec允许用户自定义开销容忍度与恢复延迟容忍度,并随网络环境变化自适应调整策略,从而在尽可能满足恢复延迟需求的同时控制开销。我们在QUIC协议上实现了AutoRec,并通过测试平台及真实商业服务部署进行评估。实验结果证明了AutoRec的实用性与效益性。