For the challenging semantic image segmentation task the most efficient models have traditionally combined the structured modelling capabilities of Conditional Random Fields (CRFs) with the feature extraction power of CNNs. In more recent works however, CRF post-processing has fallen out of favour. We argue that this is mainly due to the slow training and inference speeds of CRFs, as well as the difficulty of learning the internal CRF parameters. To overcome both issues we propose to add the assumption of conditional independence to the framework of fully-connected CRFs. This allows us to reformulate the inference in terms of convolutions, which can be implemented highly efficiently on GPUs. Doing so speeds up inference and training by a factor of more then 100. All parameters of the convolutional CRFs can easily be optimized using backpropagation. To facilitating further CRF research we make our implementation publicly available. Please visit: https://github.com/MarvinTeichmann/ConvCRF
翻译:对于具有挑战性的语义图像分割任务,效率最高的模型传统上将条件随机场的结构建模能力与CNN的特征提取能力相结合。然而,在较近的工程中,通用报告格式后处理已经失去优势。我们争论说,这主要是由于通用报告格式的培训和推论速度缓慢,以及难以学习内部通用报告格式参数。为了克服这两个问题,我们提议在完全连接的通用报告格式框架之外加上有条件独立假设。这使我们能够重新描述在GPU上可以高效执行的演进中的推论。这样加快推论和训练的速度,以100倍以上的速度进行。利用反向调整,可以方便地优化革命性通用报告格式的所有参数。为了便利进一步进行通用报告格式研究,我们公开了执行情况。请访问:https://github.com/MarvinTeichmann/ConvCRF。请访问:https://gthub.com/MarvinTeichmann/CRF/CRF。