The elementary operation of cropping underpins nearly every computer vision system, ranging from data augmentation and translation invariance to computational photography and representation learning. This paper investigates the subtle traces introduced by this operation. For example, despite refinements to camera optics, lenses will leave behind certain clues, notably chromatic aberration and vignetting. Photographers also leave behind other clues relating to image aesthetics and scene composition. We study how to detect these traces, and investigate the impact that cropping has on the image distribution. While our aim is to dissect the fundamental impact of spatial crops, there are also a number of practical implications to our work, such as revealing faulty photojournalism and equipping neural network researchers with a better understanding of shortcut learning. Code is available at https://github.com/basilevh/dissecting-image-crops.
翻译:裁剪的基本操作几乎是几乎每一个计算机视觉系统的基础,从数据增强和翻译变化到计算摄影和代表性学习等,本文调查了这一操作带来的细微痕迹。例如,尽管对摄像光学进行了改进,但镜头会留下某些线索,特别是染色异常和挥发性;摄影师还留下与图像美学和场景构成有关的其他线索。我们研究如何探测这些痕迹,并调查裁剪对图像分布的影响。我们的目的是解析空间作物的根本影响,但也对我们的工作产生一些实际影响,例如揭示错误的摄影报道,让神经网络研究人员更好地了解捷径学习。守则可在https://github.com/basilievh/dicecting-image-crops查阅。