聚众录相带非排外保留基础融合到群体层面的情感认同 (Non-Volume Preserving-based Fusion to Group-Level Emotion Recognition on Crowd Videos) - 专知论文

会员服务 ·

0

INFORMS · MoDELS · International Conference on Conceptual Modeling · Performer · 数据集 ·

2021 年 6 月 7 日

Non-Volume Preserving-based Fusion to Group-Level Emotion Recognition on Crowd Videos

翻译：聚众录相带非排外保留基础融合到群体层面的情感认同

Kha Gia Quach,Ngan Le,Chi Nhan Duong,Ibsa Jalata,Kaushik Roy,Khoa Luu

from arxiv, Under review at Patter Recognition

Group-level emotion recognition (ER) is a growing research area as the demands for assessing crowds of all sizes are becoming an interest in both the security arena as well as social media. This work extends the earlier ER investigations, which focused on either group-level ER on single images or within a video, by fully investigating group-level expression recognition on crowd videos. In this paper, we propose an effective deep feature level fusion mechanism to model the spatial-temporal information in the crowd videos. In our approach, the fusing process is performed on the deep feature domain by a generative probabilistic model, Non-Volume Preserving Fusion (NVPF), that models spatial information relationships. Furthermore, we extend our proposed spatial NVPF approach to the spatial-temporal NVPF approach to learn the temporal information between frames. To demonstrate the robustness and effectiveness of each component in the proposed approach, three experiments were conducted: (i) evaluation on AffectNet database to benchmark the proposed EmoNet for recognizing facial expression; (ii) evaluation on EmotiW2018 to benchmark the proposed deep feature level fusion mechanism NVPF; and, (iii) examine the proposed TNVPF on an innovative Group-level Emotion on Crowd Videos (GECV) dataset composed of 627 videos collected from publicly available sources. GECV dataset is a collection of videos containing crowds of people. Each video is labeled with emotion categories at three levels: individual faces, group of people, and the entire video frame.

翻译：由于对评估各种规模人群的需求正在变得对安全领域和社会媒体都感兴趣,因此群体情感识别(ER)是一个日益增长的研究领域,因为评估各种规模人群的需求正在成为对安全领域和社会媒体的兴趣。这项工作扩大了早期的ER调查范围,通过充分调查群体层面对人群视频表达的识别,将重点放在群体层面的单一图像或视频中,从而全面调查群体层面的在线情感识别(ER),在群体层面对人群情感识别(ER)是一个日益增长的研究领域。在本文中,我们建议建立一个有效的深层次特征融合机制,以模拟人群视频视频中的空间信息。在我们的方法中,通过一个基因稳定模型(NVPF),在深度特征域域域域内实施启动程序,该模型是空间信息连接(NVPF)系统(NVPF)系统(NVFC)系统(NVFC)系统(NVFC)系统(NVFS)系统(NVC)系统(NVFC)系统(NVFC)系统(NVF)系统(GVC)的每个级别的拟议视频数据采集)和(NVFFEFCS-C)系统(S-C)系统(S-C)系统(SD)的每个级别上的拟议视频数据采集)的视频数据基层)的级别。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

最新《医学图像深度语义分割》综述论文

最新《医学图像深度语义分割》综述论文

专知会员服务

97+阅读 · 2020年6月7日

【视频预测深度学习综述论文】A Review on Deep Learning Techniques for Video Prediction

【视频预测深度学习综述论文】A Review on Deep Learning Techniques for Video Prediction

专知会员服务

52+阅读 · 2020年4月15日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

CVPR2019| 04-01更新12篇论文及代码（含oral、解读，复现，涉及超分辨、零样本学习、文本检测等）

CVPR2019| 04-01更新12篇论文及代码（含oral、解读，复现，涉及超分辨、零样本学习、文本检测等）

极市平台

14+阅读 · 2019年4月1日

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

极市平台

55+阅读 · 2019年3月27日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

25篇AAAI 2018接收论文在哈工大直播预讲，顶会预先看！

25篇AAAI 2018接收论文在哈工大直播预讲，顶会预先看！

AI科技评论

6+阅读 · 2018年1月7日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

Spatiotemporal Contrastive Learning of Facial Expressions in Videos

Arxiv

0+阅读 · 2021年8月6日

The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge

Arxiv

0+阅读 · 2021年8月6日

Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting

Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting

Arxiv

0+阅读 · 2021年8月5日

Attentive Cross-modal Connections for Deep Multimodal Wearable-based Emotion Recognition

Arxiv

1+阅读 · 2021年8月4日

Multi-Branch with Attention Network for Hand-Based Person Recognition

Arxiv

0+阅读 · 2021年8月4日

Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework

Arxiv

5+阅读 · 2021年7月27日

ACTION-Net: Multipath Excitation for Action Recognition

Arxiv

3+阅读 · 2021年3月11日

Semantic Grouping Network for Video Captioning

Arxiv

8+阅读 · 2021年2月1日

Mining Dual Emotion for Fake News Detection

Arxiv

13+阅读 · 2020年10月19日

Multimodal Named Entity Recognition for Short Social Media Posts

Arxiv

8+阅读 · 2018年2月22日

VIP会员

文章信息

相关主题

International Conference on Conceptual Modeling

相关VIP内容

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

最新《医学图像深度语义分割》综述论文

最新《医学图像深度语义分割》综述论文

专知会员服务

97+阅读 · 2020年6月7日

【视频预测深度学习综述论文】A Review on Deep Learning Techniques for Video Prediction

【视频预测深度学习综述论文】A Review on Deep Learning Techniques for Video Prediction

专知会员服务

52+阅读 · 2020年4月15日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

《分层多智能体系统分类：设计范式、协调机制与工业应用》最新28页

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

相关资讯

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

CVPR2019| 04-01更新12篇论文及代码（含oral、解读，复现，涉及超分辨、零样本学习、文本检测等）

CVPR2019| 04-01更新12篇论文及代码（含oral、解读，复现，涉及超分辨、零样本学习、文本检测等）

极市平台

14+阅读 · 2019年4月1日

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

CVPR2019 | 03-27日更新12篇论文及代码汇总（多目标跟踪、3D目标检测、分割等）

极市平台

55+阅读 · 2019年3月27日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

25篇AAAI 2018接收论文在哈工大直播预讲，顶会预先看！

25篇AAAI 2018接收论文在哈工大直播预讲，顶会预先看！

AI科技评论

6+阅读 · 2018年1月7日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

Spatiotemporal Contrastive Learning of Facial Expressions in Videos

Arxiv

0+阅读 · 2021年8月6日

The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge

Arxiv

0+阅读 · 2021年8月6日

Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting

Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting

Arxiv

0+阅读 · 2021年8月5日

Attentive Cross-modal Connections for Deep Multimodal Wearable-based Emotion Recognition

Arxiv

1+阅读 · 2021年8月4日

Multi-Branch with Attention Network for Hand-Based Person Recognition

Arxiv

0+阅读 · 2021年8月4日

Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework

Arxiv

5+阅读 · 2021年7月27日

ACTION-Net: Multipath Excitation for Action Recognition

Arxiv

3+阅读 · 2021年3月11日

Semantic Grouping Network for Video Captioning

Arxiv

8+阅读 · 2021年2月1日

Mining Dual Emotion for Fake News Detection

Arxiv

13+阅读 · 2020年10月19日

Multimodal Named Entity Recognition for Short Social Media Posts

Arxiv

8+阅读 · 2018年2月22日

微信扫码咨询专知VIP会员