以自动编码器为基础的背景重建与地面地表分割,并附有背景噪音估计 (Autoencoder-based background reconstruction and foreground segmentation with background noise estimation)

Even after decades of research, dynamic scene background reconstruction and foreground object segmentation are still considered as open problems due various challenges such as illumination changes, camera movements, or background noise caused by air turbulence or moving trees. We propose in this paper to model the background of a video sequence as a low dimensional manifold using an autoencoder and to compare the reconstructed background provided by this autoencoder with the original image to compute the foreground/background segmentation masks. The main novelty of the proposed model is that the autoencoder is also trained to predict the background noise, which allows to compute for each frame a pixel-dependent threshold to perform the background/foreground segmentation. Although the proposed model does not use any temporal or motion information, it exceeds the state of the art for unsupervised background subtraction on the CDnet 2014 and LASIESTA datasets, with a significant improvement on videos where the camera is moving.

翻译：即使在经过数十年的研究之后,动态场景背景的重建和地表物体分割仍被视为开放问题,因为面临各种挑战,如照明变化、相机移动、或由气流或移动树木引起的背景噪音等。我们在本文件中提议,使用自动编码器将视频序列的背景作为低维元件进行模拟,并将该自动编码器提供的重建背景与原始图像进行比较,以计算前地/后地隔断面。拟议模型的主要新颖之处是,自动编码器还受过培训,以预测背景噪音,从而能够为每个框架配置一个依赖像素的阈值,以进行背景/前地分隔。虽然拟议的模型不使用任何时间或运动信息,但它超过了CDnet 2014 和 LASIESTA 数据集中未受监视的背景减法的艺术状态,同时对相机移动的视频作了重大改进。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

深度学习生物图像重建综述，Deep Learning for Biomedical Image Reconstruction: A Survey

专知会员服务

40+阅读 · 2020年3月2日