多领域规范参考编码实现面部表情识别的高效迁移学习 (Multi-Domain Norm-referenced Encoding Enables Data Efficient Transfer Learning of Facial Expression Recognition) - 专知论文

会员服务 ·

0

面部表情识别 · 迁移学习 · 识别 · 多领域 · 准确率 ·

2023 年 4 月 5 日

Multi-Domain Norm-referenced Encoding Enables Data Efficient Transfer Learning of Facial Expression Recognition

翻译：多领域规范参考编码实现面部表情识别的高效迁移学习

Michael Stettler,Alexander Lappe,Nick Taubert,Martin Giese

People can innately recognize human facial expressions in unnatural forms, such as when depicted on the unusual faces drawn in cartoons or when applied to an animal's features. However, current machine learning algorithms struggle with out-of-domain transfer in facial expression recognition (FER). We propose a biologically-inspired mechanism for such transfer learning, which is based on norm-referenced encoding, where patterns are encoded in terms of difference vectors relative to a domain-specific reference vector. By incorporating domain-specific reference frames, we demonstrate high data efficiency in transfer learning across multiple domains. Our proposed architecture provides an explanation for how the human brain might innately recognize facial expressions on varying head shapes (humans, monkeys, and cartoon avatars) without extensive training. Norm-referenced encoding also allows the intensity of the expression to be read out directly from neural unit activity, similar to face-selective neurons in the brain. Our model achieves a classification accuracy of 92.15\% on the FERG dataset with extreme data efficiency. We train our proposed mechanism with only 12 images, including a single image of each class (facial expression) and one image per domain (avatar). In comparison, the authors of the FERG dataset achieved a classification accuracy of 89.02\% with their FaceExpr model, which was trained on 43,000 images.

翻译：人类能够自然地识别不自然的人脸表情，比如在卡通片中描绘的不同脸型或者应用到动物特征上。但是当前的机器学习算法在面部表情识别的跨领域迁移方面仍然存在问题。本文提出了一种基于生物灵感的机制来实现这种迁移学习，该机制基于规范参考编码，其中模式是相对于领域特定的参考向量而编码的差向量。通过结合领域特定的参考帧，我们实现了在多个领域的高效迁移学习。我们的提出的架构提供了人类大脑如何在不同头型（人类、猴子和卡通形象）上自然识别面部表情的解释，而无需进行广泛的训练。规范参考编码还允许直接从神经单元的活动中读取表情的强度，类似于人脑中面部选择性神经元的功能。我们的模型在FERG数据集上实现了92.15\%的分类准确率，并且具有极高的数据效率。我们的机制仅使用12张图像进行训练，包括每个类别（面部表情）和每个领域（卡通人物）的单张图像。相比之下，FERG数据集的作者使用了43,000张图像进行训练，并在其FaceExpr模型上实现了89.02\%的分类准确率。

0

相关内容

面部表情识别

面部表情识别

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

专知会员服务

14+阅读 · 2022年3月28日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【斯坦福&Facebook】生成式对抗变换器，Generative Adversarial Transformers

专知会员服务

21+阅读 · 2021年4月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【斯坦福大学】具有共同注意力的对抗性跨域动作识别（Adversarial Cross-Domain Action Recognition with Co-Attention）

【斯坦福大学】具有共同注意力的对抗性跨域动作识别（Adversarial Cross-Domain Action Recognition with Co-Attention）

专知会员服务

38+阅读 · 2019年12月26日

【AAAI2020】用于视觉对话中深度视觉理解的自适应双向编码模型（DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue）, 中科院信工所于静等

【AAAI2020】用于视觉对话中深度视觉理解的自适应双向编码模型（DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue）, 中科院信工所于静等

专知会员服务

29+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

教你用PyTorch实现“看图说话”（附代码、学习资源）

教你用PyTorch实现“看图说话”（附代码、学习资源）

数据派THU

12+阅读 · 2018年4月25日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

基于语义网络的街区场景相似性研究

国家自然科学基金

4+阅读 · 2015年12月31日

语音感知的心理基础：上下文基频信息对声调感知的影响

国家自然科学基金

0+阅读 · 2014年12月31日

基于场景的电子医疗信息的记录机制与信息重放方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉信息处理机制的启发式聚类算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于感知信息的语音增强及客观质量评估

国家自然科学基金

0+阅读 · 2012年12月31日

形象记忆过程中信息编码和提取的脑机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

e-learning中基于学业表情的情绪认知分析研究

国家自然科学基金

0+阅读 · 2009年12月31日

声纹表征模型及其漂移鲁棒性实现方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

选择性注意驱动的图像语义理解方法与计算模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

可学习的脉冲耦合神经网络与基于视-听觉融合的人机交互方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

Arxiv

0+阅读 · 2023年5月24日

Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders

Arxiv

0+阅读 · 2023年5月24日

On Learning to Summarize with Large Language Models as References

Arxiv

0+阅读 · 2023年5月23日

QFA2SR: Query-Free Adversarial Transfer Attacks to Speaker Recognition Systems

Arxiv

0+阅读 · 2023年5月23日

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Arxiv

0+阅读 · 2023年5月22日

Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition

Arxiv

0+阅读 · 2023年5月22日

Tune-Mode ConvBN Blocks For Efficient Transfer Learning

Arxiv

0+阅读 · 2023年5月19日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Adversarial Learning for Chinese NER from Crowd Annotations

Arxiv

15+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

面部表情识别

相关VIP内容

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

【CVPR 2022】基于双噪声标签的可见光-红外人再识别学习，Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification

专知会员服务

14+阅读 · 2022年3月28日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【斯坦福&Facebook】生成式对抗变换器，Generative Adversarial Transformers

专知会员服务

21+阅读 · 2021年4月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【斯坦福大学】具有共同注意力的对抗性跨域动作识别（Adversarial Cross-Domain Action Recognition with Co-Attention）

【斯坦福大学】具有共同注意力的对抗性跨域动作识别（Adversarial Cross-Domain Action Recognition with Co-Attention）

专知会员服务

38+阅读 · 2019年12月26日

【AAAI2020】用于视觉对话中深度视觉理解的自适应双向编码模型（DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue）, 中科院信工所于静等

【AAAI2020】用于视觉对话中深度视觉理解的自适应双向编码模型（DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue）, 中科院信工所于静等

专知会员服务

29+阅读 · 2019年11月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《主观概率约束下寻找可行系统及其军事应用》69页

《美政府问责局：多种挑战影响地面战车任务出勤率》2025最新130页

《战伤医疗训练：结合实体与数字资产的轻量化模拟器概念原型设计与评估》66页

俄乌战争启示：坦克战与不断演变的战斗形态

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

教你用PyTorch实现“看图说话”（附代码、学习资源）

教你用PyTorch实现“看图说话”（附代码、学习资源）

数据派THU

12+阅读 · 2018年4月25日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

Generative Adversarial Text to Image Synthesis论文解读

Generative Adversarial Text to Image Synthesis论文解读

统计学习与视觉计算组

13+阅读 · 2017年6月9日

相关论文

AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

Arxiv

0+阅读 · 2023年5月24日

Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders

Arxiv

0+阅读 · 2023年5月24日

On Learning to Summarize with Large Language Models as References

Arxiv

0+阅读 · 2023年5月23日

QFA2SR: Query-Free Adversarial Transfer Attacks to Speaker Recognition Systems

Arxiv

0+阅读 · 2023年5月23日

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Arxiv

0+阅读 · 2023年5月22日

Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition

Arxiv

0+阅读 · 2023年5月22日

Tune-Mode ConvBN Blocks For Efficient Transfer Learning

Arxiv

0+阅读 · 2023年5月19日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Adversarial Learning for Chinese NER from Crowd Annotations

Arxiv

15+阅读 · 2018年1月16日

相关基金

基于语义网络的街区场景相似性研究

国家自然科学基金

4+阅读 · 2015年12月31日

语音感知的心理基础：上下文基频信息对声调感知的影响

国家自然科学基金

0+阅读 · 2014年12月31日

基于场景的电子医疗信息的记录机制与信息重放方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉信息处理机制的启发式聚类算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于感知信息的语音增强及客观质量评估

国家自然科学基金

0+阅读 · 2012年12月31日

形象记忆过程中信息编码和提取的脑机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

e-learning中基于学业表情的情绪认知分析研究

国家自然科学基金

0+阅读 · 2009年12月31日

声纹表征模型及其漂移鲁棒性实现方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

选择性注意驱动的图像语义理解方法与计算模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

可学习的脉冲耦合神经网络与基于视-听觉融合的人机交互方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员