通过小型人员双重解剖中心进行自下至上 2D 粒子估计 (Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons) - 专知论文

会员服务 ·

0

自下而上 · 自顶向下 · 估计/估计量 · SOTA · 模型评估 ·

2022 年 8 月 25 日

Bottom-Up 2D Pose Estimation via Dual Anatomical Centers for Small-Scale Persons

翻译：通过小型人员双重解剖中心进行自下至上 2D 粒子估计

Yu Cheng,Yihao Ai,Bo Wang,Xinchao Wang,Robby T. Tan

from arxiv, 29 pages, 12 figures and 6 tables

In multi-person 2D pose estimation, the bottom-up methods simultaneously predict poses for all persons, and unlike the top-down methods, do not rely on human detection. However, the SOTA bottom-up methods' accuracy is still inferior compared to the existing top-down methods. This is due to the predicted human poses being regressed based on the inconsistent human bounding box center and the lack of human-scale normalization, leading to the predicted human poses being inaccurate and small-scale persons being missed. To push the envelope of the bottom-up pose estimation, we firstly propose multi-scale training to enhance the network to handle scale variation with single-scale testing, particularly for small-scale persons. Secondly, we introduce dual anatomical centers (i.e., head and body), where we can predict the human poses more accurately and reliably, especially for small-scale persons. Moreover, existing bottom-up methods use multi-scale testing to boost the accuracy of pose estimation at the price of multiple additional forward passes, which weakens the efficiency of bottom-up methods, the core strength compared to top-down methods. By contrast, our multi-scale training enables the model to predict high-quality poses in a single forward pass (i.e., single-scale testing). Our method achieves 38.4\% improvement on bounding box precision and 39.1\% improvement on bounding box recall over the state of the art (SOTA) on the challenging small-scale persons subset of COCO. For the human pose AP evaluation, we achieve a new SOTA (71.0 AP) on the COCO test-dev set with the single-scale testing. We also achieve the top performance (40.3 AP) on OCHuman dataset in cross-dataset evaluation.

翻译：在多人 2D 的估算中,自下而上的方法同时预测所有的人,与自上而下的方法不同,不依赖于人类检测。然而,SOTA自下而上方法的准确性仍然低于现有的自上而下方法。这是因为,根据人与人之间不协调的捆绑箱中心,预测人与人之间的关系会倒退,缺乏人与人之间的正常化,导致预测人与人之间的关系不准确,造成小规模人员错失。为了推推自上而上之的包包包包,我们首先提议进行多级培训,以加强网络,通过单级测试处理规模差异,特别是小规模人员。第二,我们采用双级解剖式中心(即头部和身体),我们可以在其中更准确和更可靠地预测人与人之间的关系。此外,现有的自下而上而上而下的测试方法使用多规模的CO来提高假设的准确性,从而降低自下而上而上之方法的效率,核心力量与自上而下的评估方法相比。相比之下,我们的多级培训也通过SO1 的高级测试,让我们的SLA-SB 的高级测试模式,实现SAL 高级测试。

0

相关内容

自下而上

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

复盐固溶-电解共沉积制备稀土镁合金的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

压敏效应对致密多孔介质微观孔隙结构及流体流动的影响机制

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

林果害虫春尺蠖性信息素的结构鉴定、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-155/β-arrestin 2/GSK3β通路在Sca-1+心脏干细胞向心肌分化中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

遗传性LCAT缺陷症抗动脉粥样硬化发生的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

太阳能非成像聚光与多塔式阵列协同优化理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

Text-driven Video Prediction

Arxiv

0+阅读 · 2022年10月6日

DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation

Arxiv

0+阅读 · 2022年10月6日

Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection

Arxiv

0+阅读 · 2022年10月5日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

IoU-Enhanced Attention for End-to-End Task Specific Object Detection

Arxiv

0+阅读 · 2022年10月5日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry

Arxiv

0+阅读 · 2022年10月4日

PlaneDepth: Plane-Based Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2022年10月4日

Centroid Distance Keypoint Detector for Colored Point Clouds

Arxiv

0+阅读 · 2022年10月4日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】用于提升含优化层学习的算法与体系结构

【NeurIPS2025】有何不同于过去？基于自监督偏差学习的时空时间序列预测

超越决策优势：情报在创新与适应中的作用

量子计算发展态势研究报告（2025年）

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Text-driven Video Prediction

Arxiv

0+阅读 · 2022年10月6日

DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation

Arxiv

0+阅读 · 2022年10月6日

Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection

Arxiv

0+阅读 · 2022年10月5日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

IoU-Enhanced Attention for End-to-End Task Specific Object Detection

Arxiv

0+阅读 · 2022年10月5日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry

Arxiv

0+阅读 · 2022年10月4日

PlaneDepth: Plane-Based Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2022年10月4日

Centroid Distance Keypoint Detector for Colored Point Clouds

Arxiv

0+阅读 · 2022年10月4日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

相关基金

复盐固溶-电解共沉积制备稀土镁合金的基础研究

国家自然科学基金

0+阅读 · 2015年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

压敏效应对致密多孔介质微观孔隙结构及流体流动的影响机制

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

林果害虫春尺蠖性信息素的结构鉴定、合成及活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

MiR-155/β-arrestin 2/GSK3β通路在Sca-1+心脏干细胞向心肌分化中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

遗传性LCAT缺陷症抗动脉粥样硬化发生的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

太阳能非成像聚光与多塔式阵列协同优化理论研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员