While deep learning (DL)-based video deraining methods have achieved significant success recently, they still exist two major drawbacks. Firstly, most of them do not sufficiently model the characteristics of rain layers of rainy videos. In fact, the rain layers exhibit strong physical properties (e.g., direction, scale and thickness) in spatial dimension and natural continuities in temporal dimension, and thus can be generally modelled by the spatial-temporal process in statistics. Secondly, current DL-based methods seriously depend on the labeled synthetic training data, whose rain types are always deviated from those in unlabeled real data. Such gap between synthetic and real data sets leads to poor performance when applying them in real scenarios. Against these issues, this paper proposes a new semi-supervised video deraining method, in which a dynamic rain generator is employed to fit the rain layer, expecting to better depict its insightful characteristics. Specifically, such dynamic generator consists of one emission model and one transition model to simultaneously encode the spatially physical structure and temporally continuous changes of rain streaks, respectively, which both are parameterized as deep neural networks (DNNs). Further more, different prior formats are designed for the labeled synthetic and unlabeled real data, so as to fully exploit the common knowledge underlying them. Last but not least, we also design a Monte Carlo EM algorithm to solve this model. Extensive experiments are conducted to verify the superiorities of the proposed semi-supervised deraining model.
翻译:虽然基于深度学习(DL)的视频脱线方法最近取得了显著成功,但目前基于DL的视频脱线方法仍然有两大缺陷。 首先,它们大多没有足够地模拟雨中视频雨层的特征。事实上,雨层在空间层面和时间层面的自然相联性方面表现出很强的物理特性(如方向、尺度和厚度),因此一般可以通过统计中的空间时空进程进行模拟。第二,目前基于DL的方法严重地依赖于贴标签的人工合成培训数据,这些数据的降雨类型总是不同于未贴标签的真实数据。合成数据和真实数据集之间的这种差距导致在真实情景中应用这些数据集时表现不佳。对于这些问题,本文提出了一种新的半超强的视频脱线方法,其中使用了动态雨水生成器来适应雨层,期望更好地描述其洞察的特征。具体地说,这种动态生成器包括一个排放模型和一个模型转换模型,以同时编码空间上的超流体结构,以及降雨量的时空连续变化,这两种模型都是作为深度精细的合成网络的参数,而不是作为深度的合成网络的精确度,而我们先前设计出了一个不同的模型。