SGDraw: 使用以对象为导向的代表制的场景图绘制界面 (SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation) - 专知论文

会员服务 ·

0

图 · 可理解性 · 图像检索 · 表示 · Vision ·

2022 年 11 月 30 日

SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation

翻译：SGDraw: 使用以对象为导向的代表制的场景图绘制界面

Tianyu Zhang,Xusheng Du,Chia-Ming Chang,Xi Yang,Haoran Xie

from arxiv, 9 pages, 10 figures, video is https://youtu.be/acy0SNLfahg

Scene understanding is an essential and challenging task in computer vision. To provide the visually fundamental graphical structure of an image, the scene graph has received increased attention due to its powerful semantic representation. However, it is difficult to draw a proper scene graph for image retrieval, image generation, and multi-modal applications. The conventional scene graph annotation interface is not easy to use in image annotations, and the automatic scene graph generation approaches using deep neural networks are prone to generate redundant content while disregarding details. In this work, we propose SGDraw, a scene graph drawing interface using object-oriented scene graph representation to help users draw and edit scene graphs interactively. For the proposed object-oriented representation, we consider the objects, attributes, and relationships of objects as a structural unit. SGDraw provides a web-based scene graph annotation and generation tool for scene understanding applications. To verify the effectiveness of the proposed interface, we conducted a comparison study with the conventional tool and the user experience study. The results show that SGDraw can help generate scene graphs with richer details and describe the images more accurately than traditional bounding box annotations. We believe the proposed SGDraw can be useful in various vision tasks, such as image retrieval and generation.

翻译：为了提供图像的视觉基本图形结构,场景图由于具有很强的语义表达方式而得到越来越多的关注。然而,很难为图像检索、图像生成和多式应用程序绘制适当的场景图。常规场景图说明界面不容易用于图像说明,使用深神经网络的自动场景图生成方法容易产生多余的内容,而忽略细节。在这项工作中,我们提议SGDraw,一个使用对象导向的场景图绘制界面,用对象导向的场景图显示方式帮助用户以互动方式绘制和编辑场景图。对于拟议的面向目标的表示方式,我们把对象对象的物体、属性和关系视为结构单位。SGDraw为场景应用提供了一个基于网络的场景图说明和生成工具。为了核实拟议的界面的有效性,我们与传统工具及用户经验研究进行了比较研究。结果显示,SGDraw能够帮助生成更丰富细节的场景图,并更准确地描述图像,而不是传统的绑框说明。我们认为,拟议的SGDraw能够将所拟议的图像作为各种图像的检索作为有用的工具。

0

相关内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

长链非编码RNA在细颗粒物（PM2.5）诱导肺癌发生的作用与机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-TUSC7在胃癌中的抑癌作用及机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

表皮生长因子受体异构体EGFRvA对HNRNPF的调控和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经内分泌肿瘤特异性多功能纳米分子探针NIRF-CCPM-Octreotide的研究

国家自然科学基金

0+阅读 · 2013年12月31日

卵巢癌细胞中血小板活化因子受体与表皮生长因子受体信号通路串话研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-1271低表达激活IGF1R/IRS1信号通路在胃癌细胞耐药中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA MEG3抑制NSCLC顺铂耐药机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

lnc-Oct4结合miR-145上调Oct4促进膀胱癌演进的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺血脑损伤中TRPM7/ChaK1介导神经元Annexin 1膜转位及分泌在小胶质细胞活化中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

γ#27688;基丁酸通过肿瘤抗原TRAK1(MGb2-Ag)调控胃癌细胞生长的机制

国家自然科学基金

0+阅读 · 2009年12月31日

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Arxiv

0+阅读 · 2023年2月2日

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

Arxiv

0+阅读 · 2023年2月2日

Synthesizing Physical Character-Scene Interactions

Arxiv

0+阅读 · 2023年2月2日

On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters

Arxiv

0+阅读 · 2023年1月31日

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

ImGAGN:Imbalanced Network Embedding via Generative Adversarial Graph Networks

Arxiv

14+阅读 · 2021年6月5日

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Arxiv

10+阅读 · 2020年3月31日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Rotation-Sensitive Regression for Oriented Scene Text Detection

Arxiv

13+阅读 · 2018年3月14日

VIP会员

文章信息

相关主题

相关VIP内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《跨领域人工智能感知：公众态度的比较性述评》最新65页

【NeurIPS2025】MaNGO：基于元学习的可适应图网络模拟器

《加拿大陆军顶层作战概念》2025最新33页

《人工智能安全治理框架》2.0版发布，90页pdf

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Arxiv

0+阅读 · 2023年2月2日

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

Arxiv

0+阅读 · 2023年2月2日

Synthesizing Physical Character-Scene Interactions

Arxiv

0+阅读 · 2023年2月2日

On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters

Arxiv

0+阅读 · 2023年1月31日

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

ImGAGN:Imbalanced Network Embedding via Generative Adversarial Graph Networks

Arxiv

14+阅读 · 2021年6月5日

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Arxiv

10+阅读 · 2020年3月31日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Rotation-Sensitive Regression for Oriented Scene Text Detection

Arxiv

13+阅读 · 2018年3月14日

相关基金

长链非编码RNA在细颗粒物（PM2.5）诱导肺癌发生的作用与机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-TUSC7在胃癌中的抑癌作用及机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

表皮生长因子受体异构体EGFRvA对HNRNPF的调控和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经内分泌肿瘤特异性多功能纳米分子探针NIRF-CCPM-Octreotide的研究

国家自然科学基金

0+阅读 · 2013年12月31日

卵巢癌细胞中血小板活化因子受体与表皮生长因子受体信号通路串话研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-1271低表达激活IGF1R/IRS1信号通路在胃癌细胞耐药中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA MEG3抑制NSCLC顺铂耐药机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

lnc-Oct4结合miR-145上调Oct4促进膀胱癌演进的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺血脑损伤中TRPM7/ChaK1介导神经元Annexin 1膜转位及分泌在小胶质细胞活化中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

γ#27688;基丁酸通过肿瘤抗原TRAK1(MGb2-Ag)调控胃癌细胞生长的机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员