Data-free quantization can potentially address data privacy and security concerns in model compression, and thus has been widely investigated. Recently, PSAQ-ViT designs a relative value metric, patch similarity, to generate data from pre-trained vision transformers (ViTs), achieving the first attempt at data-free quantization for ViTs. In this paper, we propose PSAQ-ViT V2, a more accurate and general data-free quantization framework for ViTs, built on top of PSAQ-ViT. More specifically, following the patch similarity metric in PSAQ-ViT, we introduce an adaptive teacher-student strategy, which facilitates the constant cyclic evolution of the generated samples and the quantized model (student) in a competitive and interactive fashion under the supervision of the full-precision model (teacher), thus significantly improving the accuracy of the quantized model. Moreover, without the auxiliary category guidance, we employ the task- and model-independent prior information, making the general-purpose scheme compatible with a broad range of vision tasks and models. Extensive experiments are conducted on various models on image classification, object detection, and semantic segmentation tasks, and PSAQ-ViT V2, with the naive quantization strategy and without access to real-world data, consistently achieves competitive results, showing potential as a powerful baseline on data-free quantization for ViTs. For instance, with Swin-S as the (backbone) model, 8-bit quantization reaches 82.13 top-1 accuracy on ImageNet, 50.9 box AP and 44.1 mask AP on COCO, and 47.2 mIoU on ADE20K. We hope that accurate and general PSAQ-ViT V2 can serve as a potential and practice solution in real-world applications involving sensitive data. Code will be released and merged at: https://github.com/zkkli/PSAQ-ViT.
翻译:无数据孔化有可能解决模型压缩中的数据隐私和安全关切,因此已经进行了广泛调查。最近,PSAQ-ViT设计了一个相对价值衡量标准(SSAQ-ViT)的适应性教师-静态精确度战略,通过经过事先训练的视觉变压器(ViTs)生成数据,首次尝试为ViTs实现数据无孔化。在本文件中,我们提议PSAQ-VIT V2,一个更准确和一般数据无孔化框架,建在PSAQ-VT顶端上。更具体地说,在PSAQ-ViT的补差相似度度指标之后,我们引入了一个适应性教师-静态精确度战略战略战略战略,在全精度模型(教师)的监督下,以具有竞争力的八KVQ-VK值模型(Si-VVT) 进行大规模实验,在VI-Vial-Vial-deal-deal-dealization 数据分类中,在VI-al-al-deal-deal-deal-deal-deal Salial Salial Salial Salial Sal Sal 2,在SAL ASal Sal Silation Silation Sild 2,在SAL 数据分类中,在SA 和SAL-de 2,在SA 和SI-al-al-deal-al-al-de 2,在SI-deal-deal-deal-de 2,在实际检测和SI-deal-de 2,在SI-al-de 2,在SI-de 2,在SI-de lad 和SI lad上,在实际数据分类中,在SI la-deal-de 2,在SI-de 2,可以上进行上进行上,在SI-al-al-deal-de-de-al-deal-al-al-al-de-de-al-al-deal-deal-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-al-de