Reconstructing natural videos from fMRI brain recordings is very challenging, for two main reasons: (i) As fMRI data acquisition is diffcult, we only have a limited amount of supervised samples, which is not enough to cover the huge space of natural videos; and (ii) The temporal resolution of fMRI recordings is much lower than the frame rate of natural videos. In this paper, we propose a selfsupervised approach for natural movie reconstruction. By employing cycle consistency over Encoding-Decoding natural videos, we can: (i) exploit the full framerate of the training videos, and not be limited only to clips that correspond to fMRI recordings; (ii) exploit massive amounts of external natural videos which the subjects never saw inside the fMRI machine. These enable increasing the applicable training data by several orders of magnitude, introducing natural video priors to the decoding network, as well as temporal coherence. Our approach signifcantly outperforms competing methods, since those train only on the limited supervised data. We further introduce a new and simple temporal prior of natural videos, which when folded into our fMRI decoder further allows us to reconstruct videos at a higher framerate (HFR) of up to x8 of the original fMRI sample rate.
翻译:重新构建来自FMRI大脑录音的自然视频非常困难,原因有两大:(一) 由于FMRI数据采集不易,我们只有有限的监管样本,不足以覆盖自然视频的巨大空间;和(二) FMRI记录的时间分辨率大大低于自然视频的框架率。在本文中,我们建议对自然电影的重建采取自我监督的方法。通过对编译自然视频进行编译的周期一致性,我们可以:(一) 利用培训视频的完整框架,而不限于与FMRI记录相对应的剪辑;(二) 利用大量外部自然视频,这些视频的主体从未在FMRI机内看到过。这些视频使得应用培训数据增加了几个数量级,引入了解译网络之前的自然视频,以及时间的一致性。我们的方法标志无疑超越了竞争方法,因为这些培训仅以有限的监管数据为基础。我们进一步引入了更新原始视频的新的和简单的时间段,当这些视频被折叠成了FMRI的原始框架时,可以将原始的FMRM格式复制到FRI 。