As the COVID-19 pandemic rampages across the world, the demands of video conferencing surge. To this end, real-time portrait segmentation becomes a popular feature to replace backgrounds of conferencing participants. While feature-rich datasets, models and algorithms have been offered for segmentation that extract body postures from life scenes, portrait segmentation has yet not been well covered in a video conferencing context. To facilitate the progress in this field, we introduce an open-source solution named PP-HumanSeg. This work is the first to construct a large-scale video portrait dataset that contains 291 videos from 23 conference scenes with 14K fine-labeled frames and extensions to multi-camera teleconferencing. Furthermore, we propose a novel Semantic Connectivity-aware Learning (SCL) for semantic segmentation, which introduces a semantic connectivity-aware loss to improve the quality of segmentation results from the perspective of connectivity. And we propose an ultra-lightweight model with SCL for practical portrait segmentation, which achieves the best trade-off between IoU and the speed of inference. Extensive evaluations on our dataset demonstrate the superiority of SCL and our model. The source code is available at https://github.com/PaddlePaddle/PaddleSeg.


翻译:随着全世界COVID-19大流行大肆流行,视像会议需求激增。为此,实时肖像截图成为取代会议参与者背景的流行特征,取代会议参与者背景。虽然为从生活场景中提取身体姿势的分解提供了具有地貌丰富的数据集、模型和算法,但肖像截图尚未在电视会议背景下充分覆盖。为了便利这个领域的进展,我们引入了一个名为PP-HulmanSeg的开放源解决方案。这是首次构建一个大型视频肖像数据集,其中包含23个会议场景的291个视频,配有14K的精密标签框架和多摄像头电话会议扩展的扩展。此外,我们提议为静音分解提供一个新型的语义连通性-觉学习(SCL),从连通角度引入一个语义连通性-觉损失,以提高分解质量。我们还提议一个超轻量模型,由SCL进行实际肖像分解,实现IOU与PA的顶值交易,并实现多摄像的速。我们的数据源/PAB/Sdeldeal的高级评估。我们的数据源/SBebal/Clabs/C。

0
下载
关闭预览

相关内容

iOS 8 提供的应用间和应用跟系统的功能交互特性。
  • Today (iOS and OS X): widgets for the Today view of Notification Center
  • Share (iOS and OS X): post content to web services or share content with others
  • Actions (iOS and OS X): app extensions to view or manipulate inside another app
  • Photo Editing (iOS): edit a photo or video in Apple's Photos app with extensions from a third-party apps
  • Finder Sync (OS X): remote file storage in the Finder with support for Finder content annotation
  • Storage Provider (iOS): an interface between files inside an app and other apps on a user's device
  • Custom Keyboard (iOS): system-wide alternative keyboards

Source: iOS 8 Extensions: Apple’s Plan for a Powerful App Ecosystem
【经典书】主动学习理论,226页pdf,Theory of Active Learning
专知会员服务
126+阅读 · 2021年7月14日
【干货书】机器学习速查手册,135页pdf
专知会员服务
126+阅读 · 2020年11月20日
专知会员服务
40+阅读 · 2020年9月6日
【干货书】真实机器学习,264页pdf,Real-World Machine Learning
【电子书】机器学习实战(Machine Learning in Action),附PDF
专知会员服务
128+阅读 · 2019年11月25日
Hierarchically Structured Meta-learning
CreateAMind
26+阅读 · 2019年5月22日
Transferring Knowledge across Learning Processes
CreateAMind
28+阅读 · 2019年5月18日
Unsupervised Learning via Meta-Learning
CreateAMind
42+阅读 · 2019年1月3日
A Technical Overview of AI & ML in 2018 & Trends for 2019
待字闺中
17+阅读 · 2018年12月24日
二值多视角聚类:Binary Multi-View Clustering
我爱读PAMI
4+阅读 · 2018年6月24日
Hierarchical Imitation - Reinforcement Learning
CreateAMind
19+阅读 · 2018年5月25日
Hierarchical Disentangled Representations
CreateAMind
4+阅读 · 2018年4月15日
Auto-Encoding GAN
CreateAMind
7+阅读 · 2017年8月4日
Arxiv
7+阅读 · 2021年11月11日
Semantic Grouping Network for Video Captioning
Arxiv
3+阅读 · 2021年2月3日
Learning Blind Video Temporal Consistency
Arxiv
3+阅读 · 2018年8月1日
VIP会员
相关资讯
Top
微信扫码咨询专知VIP会员