OakInk:一个大型知识库,用于了解手用物体相互作用 (OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction) - 专知论文

会员服务 ·

0

INTERACT · 可理解性 · 基 · 知识库 · 学成 ·

2022 年 3 月 29 日

OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction

翻译：OakInk:一个大型知识库,用于了解手用物体相互作用

Lixin Yang,Kailin Li,Xinyu Zhan,Fei Wu,Anran Xu,Liu Liu,Cewu Lu

from arxiv, Accepted by CVPR 2022

Learning how humans manipulate objects requires machines to acquire knowledge from two perspectives: one for understanding object affordances and the other for learning human's interactions based on the affordances. Even though these two knowledge bases are crucial, we find that current databases lack a comprehensive awareness of them. In this work, we propose a multi-modal and rich-annotated knowledge repository, OakInk, for visual and cognitive understanding of hand-object interactions. We start to collect 1,800 common household objects and annotate their affordances to construct the first knowledge base: Oak. Given the affordance, we record rich human interactions with 100 selected objects in Oak. Finally, we transfer the interactions on the 100 recorded objects to their virtual counterparts through a novel method: Tink. The recorded and transferred hand-object interactions constitute the second knowledge base: Ink. As a result, OakInk contains 50,000 distinct affordance-aware and intent-oriented hand-object interactions. We benchmark OakInk on pose estimation and grasp generation tasks. Moreover, we propose two practical applications of OakInk: intent-based interaction generation and handover generation. Our datasets and source code are publicly available at https://github.com/lixiny/OakInk.

翻译：人类如何操控天体需要机器从两个角度获取知识:一个是理解天体,另一个是学习人基于天体的相互作用。尽管这两个知识基础至关重要,但我们发现目前的数据库缺乏对它们的全面认识。在这项工作中,我们提议建立一个多模式和丰富的附加说明的知识库OakInk,用于视觉和认知地理解手动物体相互作用。我们开始收集1 800个常见家用物体,并通知它们建造第一个知识基础:Oak。鉴于其价格,我们记录了与橡树中100个选定天体的丰富人类相互作用。最后,我们通过一种新颖的方法将100个记录对象的相互作用转移到虚拟对口单位:Tink。所记录和转移的手动物体相互作用构成第二个知识基础:Ink。结果,OakInk含有50 000个独特的支付能力与意图导向的手动对象相互作用。我们以OakInk为基准,以构建和理解生成任务。此外,我们提议两个实际应用的OakInk:基于意图的互动产生和转换。我们的数据设置和源代码在 httpsli/Ogli/源码上公开提供。

4

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【CVPR2022】OakInk:理解手-物体交互的大规模知识库

【CVPR2022】OakInk:理解手-物体交互的大规模知识库

专知会员服务

15+阅读 · 2022年4月6日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于隐马尔可夫模型分析不同天气模态下东亚地区近地面CO2浓度变化特征

国家自然科学基金

0+阅读 · 2015年12月31日

渤海湾海底沉积物中放线菌多样性及抗菌活性的初步研究

国家自然科学基金

0+阅读 · 2015年12月31日

Affordance辅助服务机器人识别形状不规则物体研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于主动微波遥感数据和光学遥感数据的干旱区绿洲棉花地表多尺度土壤湿度反演研究

国家自然科学基金

0+阅读 · 2013年12月31日

海湾水库底边界层对水体突然泛咸的双重影响机制与效应

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模式集合的陆面蒸散发研究

国家自然科学基金

0+阅读 · 2012年12月31日

脑肌电同步反馈下康复助力机器人状态评价与参数优化

国家自然科学基金

0+阅读 · 2012年12月31日

多源地理数据集成评估中目标的形式化建模及适应性信息融合方法

国家自然科学基金

0+阅读 · 2012年12月31日

"空-车"LiDAR点云数据一体化的高质量自动集成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于视频分析的儿童行为研究

国家自然科学基金

1+阅读 · 2011年12月31日

GIMO: Gaze-Informed Human Motion Prediction in Context

Arxiv

1+阅读 · 2022年4月20日

Hephaestus: A large scale multitask dataset towards InSAR understanding

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

Arxiv

0+阅读 · 2022年4月19日

ActAR: Actor-Driven Pose Embeddings for Video Action Recognition

Arxiv

0+阅读 · 2022年4月19日

Salient Objects in Clutter

Salient Objects in Clutter

Arxiv

0+阅读 · 2022年4月18日

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Arxiv

0+阅读 · 2022年4月17日

On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub

Arxiv

0+阅读 · 2022年4月17日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2022】OakInk:理解手-物体交互的大规模知识库

【CVPR2022】OakInk:理解手-物体交互的大规模知识库

专知会员服务

15+阅读 · 2022年4月6日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

281页pdf《神经网络设计入门》

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

GIMO: Gaze-Informed Human Motion Prediction in Context

Arxiv

1+阅读 · 2022年4月20日

Hephaestus: A large scale multitask dataset towards InSAR understanding

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

Arxiv

0+阅读 · 2022年4月19日

ActAR: Actor-Driven Pose Embeddings for Video Action Recognition

Arxiv

0+阅读 · 2022年4月19日

Salient Objects in Clutter

Salient Objects in Clutter

Arxiv

0+阅读 · 2022年4月18日

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Arxiv

0+阅读 · 2022年4月17日

On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub

Arxiv

0+阅读 · 2022年4月17日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Arxiv

19+阅读 · 2018年1月27日

相关基金

基于隐马尔可夫模型分析不同天气模态下东亚地区近地面CO2浓度变化特征

国家自然科学基金

0+阅读 · 2015年12月31日

渤海湾海底沉积物中放线菌多样性及抗菌活性的初步研究

国家自然科学基金

0+阅读 · 2015年12月31日

Affordance辅助服务机器人识别形状不规则物体研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于主动微波遥感数据和光学遥感数据的干旱区绿洲棉花地表多尺度土壤湿度反演研究

国家自然科学基金

0+阅读 · 2013年12月31日

海湾水库底边界层对水体突然泛咸的双重影响机制与效应

国家自然科学基金

0+阅读 · 2012年12月31日

基于多模式集合的陆面蒸散发研究

国家自然科学基金

0+阅读 · 2012年12月31日

脑肌电同步反馈下康复助力机器人状态评价与参数优化

国家自然科学基金

0+阅读 · 2012年12月31日

多源地理数据集成评估中目标的形式化建模及适应性信息融合方法

国家自然科学基金

0+阅读 · 2012年12月31日

"空-车"LiDAR点云数据一体化的高质量自动集成方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于视频分析的儿童行为研究

国家自然科学基金

1+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员