精细的Egocent Egocent 手向物体分割:数据集、模型和应用 (Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications) - 专知论文

会员服务 ·

0

INTERACT · 数据集 · MoDELS · 标注 · INFORMS ·

2022 年 8 月 7 日

Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications

翻译：精细的Egocent Egocent 手向物体分割:数据集、模型和应用

Lingzhi Zhang,Shenghao Zhou,Simon Stent,Jianbo Shi

from arxiv, 25 pages, 17 figures, ECCV 2022

Egocentric videos offer fine-grained information for high-fidelity modeling of human behaviors. Hands and interacting objects are one crucial aspect of understanding a viewer's behaviors and intentions. We provide a labeled dataset consisting of 11,243 egocentric images with per-pixel segmentation labels of hands and objects being interacted with during a diverse array of daily activities. Our dataset is the first to label detailed hand-object contact boundaries. We introduce a context-aware compositional data augmentation technique to adapt to out-of-distribution YouTube egocentric video. We show that our robust hand-object segmentation model and dataset can serve as a foundational tool to boost or enable several downstream vision applications, including hand state classification, video activity recognition, 3D mesh reconstruction of hand-object interactions, and video inpainting of hand-object foregrounds in egocentric videos. Dataset and code are available at: https://github.com/owenzlz/EgoHOS

翻译：Egocentic 视频为人类行为的高度忠诚建模提供了精细的信息。手和互动对象是理解观众行为和意图的一个关键方面。我们提供了一个标签数据集, 由11, 243张以自我为中心的图像组成, 上面贴有手和物体的像素分解标签, 在多种多样的日常活动中与这些图像互动。我们的数据集是第一个标记详细的手向对象接触边界的数据集。我们引入了一种有背景意识的构成数据增强技术, 以适应外传YouTube以自我为中心的视频。我们展示了我们强大的手向对象分割模型和数据集可以作为一个基础工具, 用来推动或帮助多个下游视觉应用, 包括手状态分类、视频活动识别、 3D 手向对象互动重建, 以及在以自我为中心的视频中手向地对手向对象的绘图。数据集和代码可以在 https://github.com/owenzlz/EgoHOS上查阅。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【CVPR 2022】领域意识和任务意识的预训练，DATA: Domain-Aware and Task-Aware Pre-training

【CVPR 2022】领域意识和任务意识的预训练，DATA: Domain-Aware and Task-Aware Pre-training

专知会员服务

7+阅读 · 2022年3月12日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

ICCV 2019 行为识别/视频理解论文汇总

ICCV 2019 行为识别/视频理解论文汇总

极市平台

15+阅读 · 2019年9月26日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

CVPR2019 | 03-12日更新13篇论文汇总（包括语义分割、领域自适应、视频分析等）

CVPR2019 | 03-12日更新13篇论文汇总（包括语义分割、领域自适应、视频分析等）

极市平台

14+阅读 · 2019年3月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

惯性微流控的机理研究及其在分离循环肿瘤细胞上的应用

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

领域驱动空间co-location模式挖掘技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

基于WRF模式系统的InSAR大气校正方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

用于生长因子类药物生殖发育毒性评价的小鼠胚胎干细胞特异性分子标记物的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

极化干涉SAR数据船只遥感分类方法与技术研究

国家自然科学基金

1+阅读 · 2009年12月31日

太阳耀斑中色球蒸发的微波和硬X射线观测研究

国家自然科学基金

0+阅读 · 2009年12月31日

Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images

Arxiv

0+阅读 · 2022年10月5日

Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos

Arxiv

0+阅读 · 2022年10月5日

Learning with Free Object Segments for Long-Tailed Instance Segmentation

Arxiv

0+阅读 · 2022年10月5日

APAUNet: Axis Projection Attention UNet for Small Target in 3D Medical Segmentation

Arxiv

0+阅读 · 2022年10月4日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

From Superpixel to Human Shape Modelling for Carried Object Detection

Arxiv

10+阅读 · 2018年1月10日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【CVPR 2022】领域意识和任务意识的预训练，DATA: Domain-Aware and Task-Aware Pre-training

【CVPR 2022】领域意识和任务意识的预训练，DATA: Domain-Aware and Task-Aware Pre-training

专知会员服务

7+阅读 · 2022年3月12日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向性能、成本效益、云边隐私与可信性的大小语言模型协作综述

乌克兰太空研究（2022-2024年） | 176页

【CMU博士论文】大型语言模型的隐性特性

国防领域人工智能走向何方？

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

ICCV 2019 行为识别/视频理解论文汇总

ICCV 2019 行为识别/视频理解论文汇总

极市平台

15+阅读 · 2019年9月26日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

CVPR2019 | 03-12日更新13篇论文汇总（包括语义分割、领域自适应、视频分析等）

CVPR2019 | 03-12日更新13篇论文汇总（包括语义分割、领域自适应、视频分析等）

极市平台

14+阅读 · 2019年3月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images

Arxiv

0+阅读 · 2022年10月5日

Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos

Arxiv

0+阅读 · 2022年10月5日

Learning with Free Object Segments for Long-Tailed Instance Segmentation

Arxiv

0+阅读 · 2022年10月5日

APAUNet: Axis Projection Attention UNet for Small Target in 3D Medical Segmentation

Arxiv

0+阅读 · 2022年10月4日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

From Superpixel to Human Shape Modelling for Carried Object Detection

Arxiv

10+阅读 · 2018年1月10日

相关基金

惯性微流控的机理研究及其在分离循环肿瘤细胞上的应用

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

领域驱动空间co-location模式挖掘技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

城市地区形变测量中的多源传感器四维SAR层析成像

国家自然科学基金

0+阅读 · 2011年12月31日

基于WRF模式系统的InSAR大气校正方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

用于生长因子类药物生殖发育毒性评价的小鼠胚胎干细胞特异性分子标记物的筛选

国家自然科学基金

0+阅读 · 2009年12月31日

极化干涉SAR数据船只遥感分类方法与技术研究

国家自然科学基金

1+阅读 · 2009年12月31日

太阳耀斑中色球蒸发的微波和硬X射线观测研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员