Machine learning techniques for more efficient video compression and video enhancement have been developed thanks to breakthroughs in deep learning. The new techniques, considered as an advanced form of Artificial Intelligence (AI), bring previously unforeseen capabilities. However, they typically come in the form of resource-hungry black-boxes (overly complex with little transparency regarding the inner workings). Their application can therefore be unpredictable and generally unreliable for large-scale use (e.g. in live broadcast). The aim of this work is to understand and optimise learned models in video processing applications so systems that incorporate them can be used in a more trustworthy manner. In this context, the presented work introduces principles for simplification of learned models targeting improved transparency in implementing machine learning for video production and distribution applications. These principles are demonstrated on video compression examples, showing how bitrate savings and reduced complexity can be achieved by simplifying relevant deep learning models.
翻译:由于在深层学习方面的突破,开发了提高视频压缩和视频强化效率的机器学习技术,这些新技术被视为人工智能的先进形式,带来了以前未曾预见到的能力,但是,这些新技术通常以资源饥饿黑盒的形式出现(内部工作透明度低,非常复杂),因此其应用对于大规模使用(如现场直播)来说可能无法预测,而且一般不可靠,这项工作的目的是了解和优化视频处理应用程序中学习的模型,以便以更值得信赖的方式使用纳入这些模型的系统;在这方面,介绍的工作提出了简化学习模型的原则,目的是提高视频制作和分销应用机械学习的透明度,这些原则在视频压缩示例中展示,表明如何通过简化相关的深层学习模型实现比特率节约和降低复杂性。