LIGHT：基于统一多任务学习网络的卫星图像中建筑物的联合提取和高度估计 (LIGHT: Joint Individual Building Extraction and Height Estimation from Satellite Images through a Unified Multitask Learning Network) - 专知论文

会员服务 ·

0

高度估计 · 提取 · 多任务学习 · 分割 · 图像解译 ·

2023 年 4 月 3 日

LIGHT: Joint Individual Building Extraction and Height Estimation from Satellite Images through a Unified Multitask Learning Network

翻译：LIGHT：基于统一多任务学习网络的卫星图像中建筑物的联合提取和高度估计

Yongqiang Mao,Xian Sun,Xingliang Huang,Kaiqiang Chen

Building extraction and height estimation are two important basic tasks in remote sensing image interpretation, which are widely used in urban planning, real-world 3D construction, and other fields. Most of the existing research regards the two tasks as independent studies. Therefore the height information cannot be fully used to improve the accuracy of building extraction and vice versa. In this work, we combine the individuaL buIlding extraction and heiGHt estimation through a unified multiTask learning network (LIGHT) for the first time, which simultaneously outputs a height map, bounding boxes, and a segmentation mask map of buildings. Specifically, LIGHT consists of an instance segmentation branch and a height estimation branch. In particular, so as to effectively unify multi-scale feature branches and alleviate feature spans between branches, we propose a Gated Cross Task Interaction (GCTI) module that can efficiently perform feature interaction between branches. Experiments on the DFC2023 dataset show that our LIGHT can achieve superior performance, and our GCTI module with ResNet101 as the backbone can significantly improve the performance of multitask learning by 2.8% AP50 and 6.5% delta1, respectively.

翻译：建筑物提取和高度估计是遥感图像解译中的两个重要基础任务，广泛应用于城市规划、实际3D建设等领域。现有研究大多视两个任务为独立研究，因此高度信息无法充分利用以提高建筑物提取的准确性，反之亦然。在本文中，我们首次通过统一多任务学习网络（LIGHT）将单个建筑物的提取和高度估计相结合，同时输出建筑物的高度图、包围框和分割掩模图。具体而言， LIGHT由一个实例分割分支和一个高度估计分支组成。特别的，为了有效统一多尺度特征分支并缓解分支之间的特征跨度，我们提出了一个Gated Cross Task Interaction（GCTI）模块，可以在分支之间有效地执行特征交互。在DFC2023数据集上的实验证明，我们的LIGHT可以实现卓越的性能，并且我们的GCTI模块以ResNet101为骨干可以显著提高多任务学习的性能，分别达到2.8％AP50和6.5％delta1。

0

相关内容

高度估计

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【CVPR2021】基于特征解构与重构学习的人脸表情识别

专知会员服务

44+阅读 · 2021年4月18日

【CVPR2021】基于跨任务场景结构知识迁移的单张深度图像超分辨率方法

专知会员服务

18+阅读 · 2021年3月23日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

KDD2020推荐系统论文聚焦

KDD2020推荐系统论文聚焦

机器学习与推荐算法

15+阅读 · 2020年6月28日

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

AI研习社

15+阅读 · 2019年5月8日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】深度学习思维导图

【推荐】深度学习思维导图

机器学习研究会

15+阅读 · 2017年8月20日

基于DSM的建筑密集区域InSAR地形去除和相位解缠

国家自然科学基金

1+阅读 · 2015年12月31日

高分辨率单极化SAR图像慢动船只散射特性稳健高层表征研究

国家自然科学基金

1+阅读 · 2015年12月31日

高分辨率极化SAR图像对象化目标分解方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

综合多特征的极化SAR灾害损毁建筑提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

行人检测中粒度空间特征提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于建筑特征及卫星图像的城市环境中移动机器人视觉定位方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于局部模式分析的特定目标检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

无线传感器网络时空一致性方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于激光雷达点云数据的隐蔽目标提取方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

大规模无线传感器监测网络节能与能量管理建模研究

国家自然科学基金

0+阅读 · 2009年12月31日

Urban GeoBIM construction by integrating semantic LiDAR point clouds with as-designed BIM models

Arxiv

0+阅读 · 2023年5月22日

DADIN: Domain Adversarial Deep Interest Network for Cross Domain Recommender Systems

Arxiv

0+阅读 · 2023年5月20日

ViDaS Video Depth-aware Saliency Network

Arxiv

0+阅读 · 2023年5月19日

Graphologue: Exploring Large Language Model Responses with Interactive Diagrams

Arxiv

0+阅读 · 2023年5月19日

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

Arxiv

15+阅读 · 2021年4月12日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets

Arxiv

15+阅读 · 2019年12月4日

Dynamic Zoom-in Network for Fast Object Detection in Large Images

Arxiv

20+阅读 · 2018年3月27日

VIP会员

文章信息

相关主题

多任务学习

相关VIP内容

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【CVPR2021】基于特征解构与重构学习的人脸表情识别

专知会员服务

44+阅读 · 2021年4月18日

【CVPR2021】基于跨任务场景结构知识迁移的单张深度图像超分辨率方法

专知会员服务

18+阅读 · 2021年3月23日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

AAAI 2020 | 姿态辅助下的多相机协作实现主动目标追踪 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

专知会员服务

34+阅读 · 2020年3月21日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用射频传感器载荷增强无人机的侦察、监视与目标获取（ISR）能力》报告

《导航战》2025最新报告

人工智能驱动的国防战术通信与网络：提升现代战争中的态势感知、安全性与自主决策 | 万字长文

《有人-无人轻型驱逐舰与中型无人水面艇支队在第二与第一岛链作战中的部署概念（CONOPS）》56页报告

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

KDD2020推荐系统论文聚焦

KDD2020推荐系统论文聚焦

机器学习与推荐算法

15+阅读 · 2020年6月28日

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

AI研习社

15+阅读 · 2019年5月8日

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

【论文推荐】最新5篇目标检测相关论文——显著目标检测、弱监督One-Shot检测、多框检测器、携带物体检测、假彩色图像检测

专知

74+阅读 · 2018年1月16日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】深度学习思维导图

【推荐】深度学习思维导图

机器学习研究会

15+阅读 · 2017年8月20日

相关论文

Urban GeoBIM construction by integrating semantic LiDAR point clouds with as-designed BIM models

Arxiv

0+阅读 · 2023年5月22日

DADIN: Domain Adversarial Deep Interest Network for Cross Domain Recommender Systems

Arxiv

0+阅读 · 2023年5月20日

ViDaS Video Depth-aware Saliency Network

Arxiv

0+阅读 · 2023年5月19日

Graphologue: Exploring Large Language Model Responses with Interactive Diagrams

Arxiv

0+阅读 · 2023年5月19日

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

Arxiv

15+阅读 · 2021年4月12日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

Multi-view Graph Contrastive Representation Learning for Drug-Drug Interaction Prediction

Arxiv

26+阅读 · 2020年12月29日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets

Arxiv

15+阅读 · 2019年12月4日

Dynamic Zoom-in Network for Fast Object Detection in Large Images

Arxiv

20+阅读 · 2018年3月27日

相关基金

基于DSM的建筑密集区域InSAR地形去除和相位解缠

国家自然科学基金

1+阅读 · 2015年12月31日

高分辨率单极化SAR图像慢动船只散射特性稳健高层表征研究

国家自然科学基金

1+阅读 · 2015年12月31日

高分辨率极化SAR图像对象化目标分解方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

综合多特征的极化SAR灾害损毁建筑提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

行人检测中粒度空间特征提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于建筑特征及卫星图像的城市环境中移动机器人视觉定位方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于局部模式分析的特定目标检测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

无线传感器网络时空一致性方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于激光雷达点云数据的隐蔽目标提取方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

大规模无线传感器监测网络节能与能量管理建模研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员