The depth information is useful in many image processing applications. However, since taking a picture is a process of projection of a 3D scene onto a 2D imaging sensor, the depth information is embedded in the image. Extracting the depth information from the image is a challenging task. A guiding principle is that the level of blurriness due to defocus is related to the distance between the object and the focal plane. Based on this principle and the widely used assumption that Gaussian blur is a good model for defocus blur, we formulate the problem of estimating the spatially varying defocus blurriness as a Gaussian blur classification problem. We solved the problem by training a deep neural network to classify image patches into one of the 20 levels of blurriness. We have created a dataset of more than 500000 image patches of size 32x32 which are used to train and test several well-known network models. We find that MobileNetV2 is suitable for this application due to its low memory requirement and high accuracy. The trained model is used to determine the patch blurriness which is then refined by applying an iterative weighted guided filter. The result is a defocus map that carries the information of the degree of blurriness for each pixel. We compare the proposed method with state-of-the-art techniques and we demonstrate its successful applications in adaptive image enhancement, defocus magnification, and multi-focus image fusion.
翻译:深度信息在许多图像处理应用程序中非常有用。 但是,由于拍摄是一个将三维场景投射到二维成像传感器上的过程, 深度信息嵌入图像中。 从图像中提取深度信息是一项具有挑战性的任务。 一项指导原则是, 脱焦导致的模糊程度与对象和焦平面之间的距离有关。 基于这一原则以及广泛使用的高萨模糊是脱焦的好模型这一假设, 我们设计了一个问题, 将空间差异的模糊度作为高斯模糊的分类问题来估计。 我们通过训练一个深神经网络, 将图像补丁划为20层模糊度之一, 来解决问题。 我们创造了一个超过500,000个尺寸为32x32的图像补丁的数据集, 用于培训和测试几个众所周知的网络模型。 我们发现, 移动网V2 适合这一应用, 因为它的记忆度低要求和高度精确度, 我们使用经过训练的模型来确定模糊度, 并随后通过应用一个对精度的精确度的导航过滤器加以改进。 我们用一个清晰度的平面图的测量结果, 我们用一个成功的图状图的精确度分析, 显示我们所绘制的平面图的精确度的精确度, 。