The explosion of short videos has dramatically reshaped the manners people socialize, yielding a new trend for daily sharing and access to the latest information. These rich video resources, on the one hand, benefited from the popularization of portable devices with cameras, but on the other, they can not be independent of the valuable editing work contributed by numerous video creators. In this paper, we investigate a novel and practical problem, namely audio beat matching (ABM), which aims to recommend the proper transition time stamps based on the background music. This technique helps to ease the labor-intensive work during video editing, saving energy for creators so that they can focus more on the creativity of video content. We formally define the ABM problem and its evaluation protocol. Meanwhile, a large-scale audio dataset, i.e., the AutoMatch with over 87k finely annotated background music, is presented to facilitate this newly opened research direction. To further lay solid foundations for the following study, we also propose a novel model termed BeatX to tackle this challenging task. Alongside, we creatively present the concept of label scope, which eliminates the data imbalance issues and assigns adaptive weights for the ground truth during the training procedure in one stop. Though plentiful short video platforms have flourished for a long time, the relevant research concerning this scenario is not sufficient, and to the best of our knowledge, AutoMatch is the first large-scale dataset to tackle the audio beat matching problem. We hope the released dataset and our competitive baseline can encourage more attention to this line of research. The dataset and codes will be made publicly available.
翻译:短视频的爆炸极大地改变了人们社交的方式,产生了每天分享和获取最新信息的新趋势。一方面,这些丰富的视频资源受益于使用相机的便携式装置的普及,但另一方面,它们不能独立于许多视频创作者提供的宝贵编辑工作。在本文中,我们调查了一个新颖而实际的问题,即声拍匹配(ABM),其目的是根据背景音乐建议适当的过渡时间邮票。这一技术有助于在视频编辑过程中简化劳动密集型工作,为创作者节省能量,以便他们能够更多地关注视频内容的创造力。我们正式定义了反弹道导弹问题及其评价协议。与此同时,大规模音频数据集,即拥有87k强度附加说明背景音乐的AutoMatch,是为了促进这一新开放的研究方向。为了进一步奠定下个研究的坚实基础,我们还提议了一个名为BeatX的新模型来应对这项具有挑战性的任务。此外,我们创造性地展示了标签范围的概念,以消除数据失衡问题,并赋予了这个数据库的创造力。在大规模地面研究过程中的适应力比重。</s>