扩大关注 " 学习深面代表:反对视觉规模变化的研究 " (Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation) - 专知论文

会员服务 ·

0

缩放 · Attention · Learning · 推断 · 层 ·

2022 年 9 月 19 日

Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation

翻译：扩大关注 " 学习深面代表:反对视觉规模变化的研究 "

Hailin Shi,Hang Du,Yibo Hu,Jun Wang,Dan Zeng,Ting Yao

Human face images usually appear with wide range of visual scales. The existing face representations pursue the bandwidth of handling scale variation via multi-scale scheme that assembles a finite series of predefined scales. Such multi-shot scheme brings inference burden, and the predefined scales inevitably have gap from real data. Instead, learning scale parameters from data, and using them for one-shot feature inference, is a decent solution. To this end, we reform the conv layer by resorting to the scale-space theory, and achieve two-fold facilities: 1) the conv layer learns a set of scales from real data distribution, each of which is fulfilled by a conv kernel; 2) the layer automatically highlights the feature at the proper channel and location corresponding to the input pattern scale and its presence. Then, we accomplish the hierarchical scale attention by stacking the reformed layers, building a novel style named SCale AttentioN Conv Neural Network (\textbf{SCAN-CNN}). We apply SCAN-CNN to the face recognition task and push the frontier of SOTA performance. The accuracy gain is more evident when the face images are blurry. Meanwhile, as a single-shot scheme, the inference is more efficient than multi-shot fusion. A set of tools are made to ensure the fast training of SCAN-CNN and zero increase of inference cost compared with the plain CNN.

翻译：人类脸部图象通常以广泛的视觉尺度出现。现有面部表情通过多尺度的组合组合一系列预定比例尺, 追求通过多尺度的图案处理比例变异的带宽。这种多发图案带来推论负担, 预定义的尺度必然会与真实数据产生差距。相反, 从数据中学习的尺度参数, 并用它们来进行一发特征推断, 是一个体面的解决办法。为此, 我们通过使用比例空间理论来改革调控层, 并实现两重设施 :1) conv 层从真实数据分布中学习一组比例表情, 每一个都是由一个内核完成的; 2 层自动突出与输入模式规模及其存在相对应的适当频道和位置的特征。然后, 我们通过堆叠改造层来完成等级表层注意, 建立名为 SCale AttentiN 的新型神经网络(\ textbf{SCAN- CN} 。我们应用 Scan- 来进行面部识别任务, 并推进SATA的前沿功能。当面图象的准确度增加时, 当面图面图面面面面面面面面面面图的图比面面面部的面部的面面面部的面部更清晰度增加成本时, 更明显, 时, 的准确性增后, 将SBAN 的频率比平面平面平面图是模糊, 。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

MFC产电驱动-ZnFe2O4/TiO2可见光催化-H2O2氧化耦合体系构筑及协同降解作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

多项式的有理扰动的动力系统

国家自然科学基金

0+阅读 · 2013年12月31日

平方本征函数对称与随机矩阵

国家自然科学基金

0+阅读 · 2013年12月31日

荧光-磁双模态纳米载体装载Survivin siRNA 对胶质瘤干细胞增殖的影响及作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

活性氧在COPD肺血管内皮细胞Bcl-2基因高甲基化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

稀土掺杂的硅氮化合物的制备、结构调控和发光性能

国家自然科学基金

0+阅读 · 2012年12月31日

稀土掺杂PMN-PT单晶的上转换发光及其压电耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

Rho信号通路在张应变诱导人牙周膜细胞细胞骨架重建中的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

白光LED用高光效Re3+:(Y/Gd)3(Al/Ga)5O12荧光晶体的制备及发光性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

Large-scale learning of generalised representations for speaker recognition

Arxiv

0+阅读 · 2022年10月27日

Learning on Large-scale Text-attributed Graphs via Variational Inference

Arxiv

0+阅读 · 2022年10月26日

Analyzing Deep Learning Representations of Point Clouds for Real-Time In-Vehicle LiDAR Perception

Arxiv

0+阅读 · 2022年10月26日

Multi-view Representation Learning from Malware to Defend Against Adversarial Variants

Arxiv

0+阅读 · 2022年10月25日

Salient Object Detection via Dynamic Scale Routing

Arxiv

0+阅读 · 2022年10月25日

S3E: A Large-scale Multimodal Dataset for Collaborative SLAM

Arxiv

0+阅读 · 2022年10月25日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

Arxiv

11+阅读 · 2021年4月29日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

相关论文

Large-scale learning of generalised representations for speaker recognition

Arxiv

0+阅读 · 2022年10月27日

Learning on Large-scale Text-attributed Graphs via Variational Inference

Arxiv

0+阅读 · 2022年10月26日

Analyzing Deep Learning Representations of Point Clouds for Real-Time In-Vehicle LiDAR Perception

Arxiv

0+阅读 · 2022年10月26日

Multi-view Representation Learning from Malware to Defend Against Adversarial Variants

Arxiv

0+阅读 · 2022年10月25日

Salient Object Detection via Dynamic Scale Routing

Arxiv

0+阅读 · 2022年10月25日

S3E: A Large-scale Multimodal Dataset for Collaborative SLAM

Arxiv

0+阅读 · 2022年10月25日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

Arxiv

11+阅读 · 2021年4月29日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

相关基金

MFC产电驱动-ZnFe2O4/TiO2可见光催化-H2O2氧化耦合体系构筑及协同降解作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

多项式的有理扰动的动力系统

国家自然科学基金

0+阅读 · 2013年12月31日

平方本征函数对称与随机矩阵

国家自然科学基金

0+阅读 · 2013年12月31日

荧光-磁双模态纳米载体装载Survivin siRNA 对胶质瘤干细胞增殖的影响及作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

活性氧在COPD肺血管内皮细胞Bcl-2基因高甲基化中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

稀土掺杂的硅氮化合物的制备、结构调控和发光性能

国家自然科学基金

0+阅读 · 2012年12月31日

稀土掺杂PMN-PT单晶的上转换发光及其压电耦合研究

国家自然科学基金

0+阅读 · 2012年12月31日

Rho信号通路在张应变诱导人牙周膜细胞细胞骨架重建中的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

白光LED用高光效Re3+:(Y/Gd)3(Al/Ga)5O12荧光晶体的制备及发光性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员