Learned image compression methods generally optimize a rate-distortion loss, trading off improvements in visual distortion for added bitrate. Increasingly, however, compressed imagery is used as an input to deep learning networks for various tasks such as classification, object detection, and superresolution. We propose a recognition-aware learned compression method, which optimizes a rate-distortion loss alongside a task-specific loss, jointly learning compression and recognition networks. We augment a hierarchical autoencoder-based compression network with an EfficientNet recognition model and use two hyperparameters to trade off between distortion, bitrate, and recognition performance. We characterize the classification accuracy of our proposed method as a function of bitrate and find that for low bitrates our method achieves as much as 26% higher recognition accuracy at equivalent bitrates compared to traditional methods such as Better Portable Graphics (BPG).
翻译:学习图像压缩方法通常优化率扭曲损失,将视觉扭曲的改进换成增加的比特速率。 但是,压缩图像越来越多地被用作诸如分类、对象探测和超分辨率等各种任务的深层学习网络的投入。 我们建议了一种承认-觉悟学习的压缩方法,该方法将率扭曲损失与特定任务损失、共同学习压缩和识别网络优化为最佳。 我们用高效的网络识别模型强化了基于自定义的等级压缩网络,并使用两个超强参数来交换扭曲、比特率和识别性能。 我们将我们拟议方法的分类准确性描述为比特率的函数,发现对于低比特率,我们的方法在等量的比特率比比更佳的图形(BPG)等传统方法达到26%更高的确认率。